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M.R.  Leadbetter 
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The  purpose  of  this  paper  is  to  provide  an  overview  of  the  asymptotic 
distributional  theory  of  extreme  values  for  a  wide  class  of  dependent 
stochastic  sequences  and  continuous  parameter  processes.  The  theory  contains 
the  standard  classical  extreme  value  results  for  maxima  and  extreme  order 
statistics  as  special  cases  but  is  richer  on  account  of  the  diverse  behavior 
possible  under  dependence  in  both  discrete  and  continuous  time  contexts. 
Emphasis  is  placed  on  stationary  cases  but  other  inportant  classes  (e.g. 
Markov  sequences)  are  included.  Significant  ideas  and  methods  are  described 
rather  than  details,  and  in  particular  the  nature  and  role  of  important 
underlying  point  processes  (such  as  exceedances  and  upcrossings)  are 
emphasized.  Applications  are  given  to  particular  classes  of  processes  (e.g. 
normal,  moving  average)  and  connections  with  related  theory  (such  as 
convergence  of  sums)  are  indicated. 
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The  purpose  of  this  paper  is  to  give  a  "motivated  overview"  of  the 
principal  results  in  and  related  to  the  distributional  theory  of  extremes  of 
stationary  sequences  and  processes.  In  particular  we  shall  be  concerned  with 
distributional  properties  of  the  maximum  =  max( £ £ 2' *  ’  * '  ^  anc*  ot*ier 
order  statistics  from  stationary  sequences  { \  as  n  a  oo  and  with 
corresponding  results  for  continuous  parameter  processes.  The  emphasis 
throughout  will  be  on  the  motivation  for  and  significant  methods  used  in 
obtaining  the  results.  Full  proofs  will  not  generally  be  given  -  in  many 
cases  the  details  of  such  proofs  may  be  found  in  the  volume  [55],  or  from  the 
references  cited. 

The  results  to  be  described  may,  in  part,  be  regarded  as  extensions  of 
the  classical  theory  of  extremes  of  sequences  of  independent,  identically 
distributed  (i.i.d.)  random  variables  (r.v.’s).  However,  they  constitute  more 
than  just  such  an  extension  of  the  classical  theory,  since  the  dependent 
framework  provides  a  natural  setting  for  the  theory  and  one  in  which  its 


essential  ideas  and  methods  may  be  clearly  exposed.  In  particular,  it  will  be 
seen  that  the  central  results  may  often  be  regarded  as  special  cases  of  the 


convergence  of  certain  point  processes  -  a  view  which  may  of  course  be  taken 
in  the  classical  case  but  which  is  less  needed  there  in  view  of  the  detailed 


/TV 


i.i.d.  assumptions.  Our  discussion  will  emphasize  the  centrality  of  these 
underlying  point  process  convergence  results. 

As  indicated  in  the  list  of  contents,  this  paper  is  organized  in  three 
main  parts.  This  first  introductory  part  contains  central  distributional 
results  of  the  classical  i.i.d.  theory  and,  in  particular,  the  "Extremal  Types 
Theorem"  which  restricts  the  possible  limiting  distributions  for  maxima  to 
essentially  three  "different  types".  We  shall  indicate  the  general 
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organization  and  main  features  of  the  most  recently  available  derivations  of 
these  results,  using  the  simple  and  elegant  approach  of  de  Haan  via  inverse 
functions.  As  can  be  seen  even  this  i.i.d.  theory  becomes  most  natural  and 
transparent  when  viewed  from  the  standpoint  of  the  behavior  of  related  point 
processes  -  such  as  the  exceedances  of  high  levels. 

The  second  part  of  the  paper  concerns  extremes  of  sequences  - 
primarily  (but  not  always)  assumed  stationary  and  is  largely  based  on  point 
process  methods.  It  will  be  seen  that  the  classical  theory  may  be  regarded  as 
a  special  case  of  the  more  general  theory  for  dependent  sequences  -  some 
results  being  identical  and  others  generalizing  in  interesting  and  non-trivial 
ways.  For  example,  it  will  be  seen,  under  weak  dependence  restrictions,  that 
the  general  "type"  of  limiting  distribution  for  the  maximum  is  the  same  as  for 
an  i.i.d.  sequence  with  the  same  marginal  d.f.  (though  the  normalizing 
constants  may  change) .  However,  the  limiting  distributions  for  other  order 
statistics  can  be  quite  different  from  those  under  i.i.d.  assumptions . 

Some  particular  cases  of  special  interest  (e.g.  normal  sequences, 
moving  averages,  Markov  sequences)  will  be  discussed  in  Part  2.  Other  aspects 
of  the  theory  (e.g.  rates  of  convergence,  multivariate  extremes)  are  also 
briefly  described  along  with  some  interesting  connections  with  convergence  of 
sums. 

In  Part  3  attention  is  turned  to  continuous  parameter  processes.  The 
theory  here  may  be  made  to  rest  on  the  sequence  case  by  the  simple  device  of 
regarding  the  maximum  of  a  process  £  (t)  up  to,  say,  time  T  =  n  as  the  maximum 
of  the  values  of  the  sequence  /  .  =  sup{  £  (t) :  i-l<tsi),  for  l<i<n.  While 
this  is  simple  and  obvious  in  principle,  the  details  are  more  complicated  and 
require  analogous  but  somewhat  more  intricate  assumptions  regarding  the 
dependence  structure  of  the  process.  The  point  process  approach  is  also  very 


The  central  result  of  classical  extreme  value  theory,  due  in  varying 
degrees  of  generality  to  Frechet  [35],  Fisher  and  Tippett  [34],  and  Gnedenko 
[37],  restricts  the  class  of  possible  limiting  d.f's  G  in  (1.2.1)  to 
essentially  three  different  types  as  follows. 

Theorem  1.2.1  (Extremal  Types  Theorem^ ■  Let  =  max  (  £ £ 2 > • • • ,  £n>  where 

£i  are  i.i.d.  If  (1.2.1)  holds  for  same  constants  a  >  0,  b  and  some  non 

n  n 

degenerate  G,  then  G  must  have  one  of  the  following  forms  (in  which  x  may  be 
replaced  by  ax  +  b  for  any  a  >  0,  b) : 

Type  I:  G(x)  =  exp(-e-x)  -<»  <  x  <  a> 

[0  x  <  0,) 

Type  II:  G(x)  = 

lexp(-x~a),  for  some  a  >  0  x  >  0  ) 

(exp(-(-x)a) ,  for  some  a  >  0  x  <  0,) 

Type  III:  G(x)  = 

ll,  x  >  O.j 

Conversely  any  such  d.f.  G  may  appear  as  a  limit  in  (1.2.1)  and  in  fact  does 
so  when  G  is  itself  the  d.f.  of  each 

It  will  be  convenient  to  say  that  two  non-degenerate  d.f. 's  G1  and 

are  of  the  same  type  if  G^(x)  =  G2(ax  +  b)  for  same  a  >  0,  b,  and  to  refer 

to  the  equivalence  classes  so  determined  as  "types".  The  use  of  "type"  in  the 

above  theorem  is  a  slight  abuse  of  this  since  Types  II  and  III  really 

represent  families  of  types — one  corresponding  to  each  a  >  0.  However  this 

abuse  is  convenient  and  it  is  conventional  to  refer  to  "the  three  types"  of 

limit.  It  should  also  be  noted  that  the  three  types  may  be  incorporated  into 

a  single  family,  for  example  by  writing  Ga  (x)  =  exp{-(l-ax)  1/a) ,  -o<a<®, 

ax  <  1,  Gq  being  interpreted  as  lim  Ga  (x)  =  exp(-e~x) .  (Such  a  para- 

a-»  0 

metrization  was  introduced  by  von  Mises) . 

Before  indicating  the  main  features  of  the  proof  of  this  theorem,  it 


will  be  convenient  to  state  the  widely  used  result  of  Khintchine  which  enters 
extremal  theory  here  and  elsewhere  in  significant  ways. 

lemma  1.2.2  (Khintchine^ ,  Let  {F^}  be  a  sequence  of  d.f . 's  and  G  a  non¬ 
degenerate  d.f.  Let  a  >  0,  b  be  constants  such  that  F  (a  x  +  b  )  -  G(x) . 

n  n  1  n  n  n  '  ' 

Then  for  some  nondegenerate  d.f.  G.  and  constants  a  >  0.B  , 

*  n  '  n' 

Fn^anx+Bn^  ^  G*M  if  ^  only  if  an  1  an  *  a  31x1  ^"^n  “  bn)  ^  b  for  same 
a  >  0  and  b,  and  then  G^(x)  =  G(ax+b)  so  that  G  and  G^  are  of  the  same  type. 

The  derivation  of  Theorem  1.2.1  can  be  conveniently  divided  into  two 
parts  and  this  division  is  most  useful  for  later  forms  of  the  result.  The 
first  part  is  to  show  that  the  class  of  limit  laws  G  in  (1.2.1)  is  precisely 
the  class  of  max-stable  d.f. 's.  Specifically  a  d.f.  G  is  called  max- 
stable  if  for  each  n=l,2,...  the  d.f.  (f1  is  of  the  same  type  as  G,  i.e.  if 
there  exist  constants  an  >  0,bn  such  that  G^^x  +  bn)  =  G(x) .  The  following 
lemma — which  follows  readily  using  Lemma  1.2.2 — gives  an  immediate  proof  of 
this  part. 

1-2.3.  A  non-degenerate  d.f.  G  is  max-stable  if  and  only  if  there  is  a 

sequence  (F  }  of  d.f. 's,  and  constants  a  >  0,b  such  that 
n  n  n 

-1  ^  l/k 

(1.2.2)  Fn(anjcx  +  ■*  G  '  (x)  as  mo  for  each  k=l,2... 

It  is  easily  seen  from  this  that  any  G  satisfying  (1.2.1)  is  max- 
stable  by  simply  identifying  Fn  with  F1^  where  F  is  the  d.f.  of  each  ^  For 

_l  d 

(1.2.1)  is  just  Fr‘(an  x+bn)-»  G(x)  and  replacing  n  by  nk  yields  (1.2.2)  at 

once.  The  converse  is  even  simpler  since  if  G  is  max  stable  and 
n  -*2. 

G  (an  x+b^)=  G(x)  for  some  an  >  0,  bn,  an  i.i.d.  sequence  with  common  d.f.  G 


satisfies 
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P(dn(Mn  -  bn)  <  x}  =  Gn(a^-x  +  bn)  =  G(x) 
so  that  (1.2.1)  holds  trivially  and  G  is  indeed  a  limit  law  for  maxima. 

The  essential  point  of  this  argument  is  that  if  Fn  is  the  d.f.  of  Mn> 
then  the  assumption  (1.2.1)  is  the  same  as  (1.2.2)  with  k  =  1  and  the 
independence  shows  that  if  (1.2.2)  holds  with  k  =  1,  it  holds  with  k  =  2,3,... 
so  that  G  is  max-stable.  It  will  be  seen  later  that  the  same  line  of  argument 
holds  in  dependent  cases  though  the  details  are  more  complicated  since  F^  is 
no  longer  precisely  F*\ 

Thus  the  class  of  non-degenerate  limit  laws  for  maxima  is  precisely 
the  class  of  max-stable  d.f. 's.  The  other  part  to  the  proof  of  Theorem  1.2.1 
is  to  identify  the  class  of  max-stable  d.f.  's  with  the  Type  I,  II  and  III 
extreme  value  d.f.'s.  This  is  a  purely  function-analytic  (non-probabilistic) 
procedure  and  will  apply  verbatim  in  dependent  cases.  It  is  in  fact  trivial 
to  show  that  each  extreme  value  d.f.  is  max-stable.  The  more  important 
converse  is  readily  shown  by  transforming  the  definition  of  max-stability  of  G 
to  give  a  functional  equation  for  the  function  U(y)  inverse  to 
-log(-log  G(x) ) ,  which  may  be  readily  solved  (cf.  [38])  to  show  that  G  must  be 
one  of  the  three  extreme  value  types. 

It  is,  of  course,  important  to  know  which  (if  any)  of  the  three  types 
of  limit  law  applies  when  {  has  a  given  d.f.  F.  Necessary  and  sufficient 
conditions  are  known,  involving  only  the  behavior  of  the  tail  l-F(x)  as  x 
increases  for  each  possible  limit.  One  form  of  such  domain  of  attraction 
criteria  is  as  follows.  In  this  Xp  (<  ®)  will  denote  the  right  hand  end 
point  sup{t:  F(t)  <  1}  of  the  d.f.  F. 

Theorem  1.2.4.  Necessary  and  sufficient  conditions  for  the  d.f.  F  of  the 
r.v. ’s  of  the  i.i.d.  sequence  {  f  }  to  lead  to  each  of  the  three  types  are: 


/  .p  /v  .  -j.  *  .  •  .  *- '  +  \m  *  *.•  .* 


/.  • 

s, 
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Type  I:  There  exists  some  strictly  positive  function  g(t)  such  that 
lira  (l-F(t+xg(t) ) )/(l-F(t) )  =  e~x  for  all  real  x; 

t^Xp 

Type  II:  Xp  =  co  and 

lim  (l-F(tx) )/ (l-F(t) )  =  x_ct,  a  >  0,  for  each  x  >  0; 

t->  CO 

Type  III:  Xp  <  <x>  and 

lim  (l-F(x--xh)  )/(l-F(x--h) )  =  x* ,  a>0,  for  each  x  >  0. 
hi  0  T  T 

The  sufficiency  of  these  conditions  is  readily  established  (cf.  [55]). 
The  necessity  is  more  complicated  (though  perhaps  also  less  important)  but  may 
be  achieved  by  using  methods  of  regular  variation  (cf.  [39]  for  a  recent 
smooth  treatment) . 

The  following  simple  result  is  also  used  in  these  "domain  of 
attraction"  determinations. 

Theorem  1.2.5.  Let  {un,  n  >  1)  be  constants  and  0  <  r  <  co.  If  £ l 2, . . . 
are  i.i.d.  with  d.f.  F  then 

(1.2.3)  P{MnSUn}  *  e~  T 
if  and  only  if 

(1.2.4)  n(l  -  F(un))  ■*  t. 

This  result  is  proved  almost  trivially  by  writing  P{Mn<un)  =  r(un) 

=  (1-(1-F(u  ) ) )n  in  this  i.i.d.  context.  It  is  stated  formally  since  its 
generalizations  to  dependent  cases  are  important  and  much  less  trivial  in 
proof.  It  may  be  noted  that  (1.2.1)  is  a  special  case  of  (1.2.3)  using  a 
linear  parametrization,  by  making  identifications  r  =  -log  G(x) ,  un=a~1x+br). 
Thus  a  necessary  and  sufficient  condition  for  the  limit  G  is 
n(l  -  F(an1x  +  b  ))  ->  -log  G(x) ,  as  mo, 
for  each  x,  and  some  a  >  0,  b  .  This  explains  the  relevance  of  the  tail 


1  -  F(x)  in  Theorem  1.2.4  which  examines  the  existence  of  such  a  ,b  for  each 

n  n 

of  the  cases  -log  G(x)  =  e-x,  x_a,  (-x)a. 

By  applying  this  result,  forms  for  the  normalizing  constants  may  also 
be  obtained.  Specifically  if  7n  is  defined  to  satisfy  F(7  -)  2  l-n 


b  nay  be  taken 
n  J 

in  each  case 
a 

n 

to  be: 

-1 

b 

n 

Type  I: 

(g(V) 

-1 

7 

n 

Type  II: 

7  ± 

n 

-1 

0 

Type  III: 

<*F  -  V 

I 

*F 

(using  the  notation  of  Theorem  1.2.4).  Of  course  while  the  "(l-n  1)- 
percentile  7^  may  be  determined  (and  hence  an,bn  found)  when  F  is  known,  the 
practical  problem  lies  in  the  estimation  of  those  constants  when  the  form  of  F 
is  not  precisely  known. 

It  is  readily  checked  that  a  standard  normal  sequence  belongs  to  the 
Type  I  domain  with  normalizing  constants 

(1.2.5)  an=(2  log  n)1/2 

bn=(2  log  n) 1//2  -  ~  (2  log  n)  1//2(log  log  n  +  log  4 7T )  . 

The  exponential  and  log  normal  distributions  also  have  Type  I  limits  as  does 
the  d.f.  F(x)  =  l-e1//x  (x  <  0)  with  a  finite  right  endpoint  Xp  =  0. 

The  Pareto  and  Cauchy  distributions  give  Type  II  limits  whereas  the 
uniform  distribution  belongs  to  the  Type  III  domain. 

It  should  be  noted  that  not  every  d.f.  F  belongs  to  a  domain  of 
attraction  at  all.  The  most  common  case  occurs  for  certain  discrete 
distributions — such  as  the  Poisson  and  geometric  distributions — for  which 
there  is  no  sequence  (un)  such  that  (1.2.4)  holds.  This  typically  happens  in 
cases  when  the  jumps  of  the  d.f.  do  not  decay  sufficiently  quickly  relative  to 


mm 


the  tail.  In  fact  for  a  given  r  (0<T<a) ,  a  sequence  (un)  satisfying  (1.2.4) 
may  be  found  if  and  only  if 


C.-C.' 

e  >ju. 

I’KS 


(1.2.6)  (F(x)-F(x-) )/(l-F(x) )-»  0  asx  f  Xp 

It  is  readily  checked  that  (1.2.6)  fails  for  the  Poisson  and  geometric 


cases  and  hence  there  can  be  no  un  of  any  form  (and  certainly  not  of  the  form 
a^x  +  b  )  satisfying  (1.2.4)  with  r  =  -log  G(x),  so  that  no  limiting 
distribution  exists.  However  it  is  also  possible  for  there  to  be  no  limit 

even  if  (1.2.6)  is  satisfied — indeed  for  certain  continuous  d.f. 's.  A  case  in 

•  •  -x-sin  x 

point  is  the  d.f.  F(x)  =  1-e  ,  an  example  due  to  Von  Mises. 

We  turn  now,  in  this  brief  tour  of  classical  results,  to  other  extreme 
order  statistics,  writing  for  the  largest  among  the  i.i.d.  ,£n 

with  common  d.f.  F.  Suppose  that  (1.2.6)  holds  and  hence  for  any  fixed  t>0, 
u^=  un(T)  roay  be  found  satisfying  (1.2.4).  let  Sn  denote  the  number  of 
exceedances  of  u  by  ?n,...,J  ,  i.e.  S  is  the  number  of  i,  l<i<n,  such 
that  (v  >  u  .  Clearly  for  k  =  1,2, ... , 

,(k) 


(1.2.7) 


P{M 


n 


<  un)  =  P{Sn  <  k) 


since  the  events  in  brackets  are  identical.  But  S  is  binomial  with 

n 

parameters  (n,p  )  ,  p^=  1  -  F(u  ) ,  np^  t  t  so  that  Sn  has  a  Poisson  limit  with 


mean  r  and  hence 

(1.2.8) 


P{M^  <  u  )  -» 
1  n  n; 


k-i 

1  r  /r! 
r^O 


Suppose  now  that  Mn  =  max(?  $2, . . . ,  f  n)  has  limiting  distribution  G  so  that 
(1.2.1)  holds.  By  the  standard  identification  un  =  a^  1x+b^,  T  =  -log  G(x)  we  see 
from  Theorem  1.2.5  that  (1.2.4)  holds  and  hence  from  (1.2.8)  that 


p'an<Hnk)  - 


k_1  s 

b  )  £  x)  -»  G (x)  l  (“log  G(x) )  /si 
n  s=0 


IVi 


tm 

"  **„*’  % 

■>v> 


m 

--V-J 

•  : 

..  v  k  1 


(1.2.9) 


Thus  if  the  maximum  has  a  limiting  distribution  G,  then  the  k  largest 

has  a  limiting  distribution  given  by  (1.2.9)  (with  the  same  normalizing 

constants  a  ,b  as  the  maximum  itself) . 
n  n 

These  results  foreshadow  a  more  detailed  discussion  of  the  exceedances 
and  related  point  processes,  which  will  be  taken  up  in  the  next  section. 

Finally,  topics  from  the  classical  theory  not  dealt  with  in  this 
present  part  include  (a)  rate  of  convergence  results  (considered  in  the 
dependent  setting  in  Section  2.8,  (b)  asymptotic  distributions  of  minima 
(obtainable  by  simple  transformations  of  the  results  for  maxima) ,  and  (c) 
asymptotic  theory  of  variable  rank  order  statistics  (cf.  [85]). 

1.3  Point  processes  associated  with  extremes 

The  heart  of  the  previous  simple  calculation  leading  to  (1.2.9)  is 
that  if  n(l-F(un))  -»  r  then  the  number  of  exceedances  of  by  ?1***^n  is 
asymptotically  Poisson  with  mean  r .  This  simple  observation  is  capable  of 
considerable,  useful  generalization  both  for  the  present  i.i.d.  and  for 
dependent  cases.  The  simplest  of  these  results  concerns  the  point  process 
N  of  exceedances  of  the  level  u  .  Specifically  N  consists  of  the  point 
process  on  (0,1]  formed  by  normalizing  the  actual  exceedance  points  by  the 
factor  1/n  i.e.  if  i  is  the  time  of  an  exceedance  (^  >  un)  then  a  point  of 
Nn  is  plotted  at  i/n.  If  E  c  (0,1]  then  Nn(E)  denotes  the  number  of  such 
points  in  E,  so  that  Nn(E)=#{i/n  e  E:  >  u^,  lsisn)  =  #{i  e  nE:  >  un, 

l<i<n).  The  actual  exceedance  points  and  the  point  process  Nn  are 
illustrated  in  Figure  1.3.1  below. 

E, 

Ur* 


0  1 


Figure  1.3.1  Point  process  of  exceedances 
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One  of  the  central  results  is  that  the  point  process  Nn  takes  on  an 
asymptotic  Poisson  character  as  n  increases  in  the  sense  that  Nn  converges  in 
distribution  to  a  Poisson  process  N  as  n  •>  o  (Theorem  1.3.1  below).  A  brief 
discussion  of  relevant  features  of  point  processes  and  their  properties  will 
be  given  in  Section  2.3  but  for  now  it  is  sufficient  to  note  that  convergence 
in  distribution  of  point  processes  may  be  expressed  in  terms  of  convergence  of 
finite  dimensional  distributions;  specifically  Nn  $  N  if  N({a})=0  for  each  a  and 
(1.3.1)  (Nn(a1,b1],  Nn(a2/b2]...Nn(ak,bk])  3  (Nfa^b-J,  N(a2,b2] , . .  .N(ak,bk] ) 

for  each  choice  of  k  and  subintervals  (a^,b^]  c  (0,1]. 

Theorem  1.3.1  Let  ...  be  i.i.d.  with  common  d.f.  F  and  let  (un)  satisfy 

(1.2.4) .  Then  Nn  ^  N,  where  N  is  a  Poisson  process  on  (0,1]  with  intensity  r. 

This  result  is  almost  self evident  by  virtue  of  the  criterion  (1.3.1) . 

For  example  Nn((a,b])  is  the  number  of  i  e  (na,nb]  for  which  >  ufi  and 
hence  is  binomial  with  parameters  ([nb]-[na] ) ,  1-F(un))  and  converges  to  the 
Poisson  r.v.  N((a,b])  having  mean  r (b-a)  since  ([nb]-[na] ) (l-F(un) )->  t  (b-a)  by 
(1.2.4).  The  more  general  statement  (1.3.1)  clearly  follows  by  independence 
if  the  intervals  (a^,b^]  are  disjoint,  and  in  general  by  considering  the 
overlaps  between  the  intervals  and  thereby  reducing  the  problem  to  the  case  of 
disjoint  intervals  in  an  obvious  way. 

Theorem  1.3.1  clearly  includes  (1.2.7)  and  hence  (1.2.9)  since 
N  ( (0,1]) =£>n  so  that  it  may  be  regarded  as  a  "fountainhead"  result  from  which 
the  asymptotic  distributions  for  the  maximum  and  all  extreme  order  statistics 
follow.  The  result  may  be  extended  by  considering  more  than  one  leve’  to  give 
in  particular  asymptotic  joint  distributions  of  finite  numbers  of  order 
statistics.  Specifically  let  0  <  r1  <  r 2 ...<  rr  be  fixed  constants  and 
{ un ( t ) )  such  that 


-  A\ 
■A  . 
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(1.3.2)  n(l  -  F(un(r))  ->  T 

for  each  r>  0,  where  un(T)  are  taken  so  that  (  t  . )  -  un^T2^  ‘  *  *"un^Tr^  * 

Consider  exceedances  of  the  levels  un(Ti) / • • • /un(Tr)  33  illustrated  in  Fig. 
1.3.2(a)  (again  normalizing  the  time  scale  by  1/n) .  A  vector  point  process  is 
thus  obtained  and  may  be  visualized  by  points  on  fixed  lines  in 

the  plane  as  shown  in  Fig.  1.3.2(b) .  Denoting  these  individual  point  processes  by 


0  (b) 


Figure  1.3.2  (a)  Levels  and  values  of  { £  )  (b)  Representation  on  fixed  lines 
L^L L1,L2,...,Ls 

,  1  <  k  <  r,  it  is  clear  that  each  is  asymptotically  Poisson  with 
intensity  and  that  is  a  "thinned"  version  of  for  2<k<r. 

In  fact  in  the  limit  the  thinnings  involve  independent  removal  of  events. 

To  see  this  more  explicitly  let  Nn  be  the  process  defined  on  the  plane  by 
the  points  of  l<k<r  (i.e.  confined  to  the  lines  1^,1^, ...  ,1^.)  and 
let  N  be  a  point  process  in  the  plane  defined  as  follows: 

Let  (o’-Lj?  1=1,2,...}  be  the  points  of  a  Poisson  Process  Nv  1  with 
parameter  on  Lr  and  let  {6^,  j=l,2,...}  be  i.i.d.  random  variables, 
independent  also  of  the  Poisson  process  on  L^,  taking  values  l,2,...,r  with 


probabilities 


P{Ji  =s)  = 


(  (t  ..-t  )/t  ,  s=l,2, . . . ,r-l, 

v  r-s+1  r-s"  r  '  '  '  '  ' 

\  r-j/t.  ,  s=r, 


mi 


■'T'VV  -  V 


i.e.  P{  3  •  >  s}=  t  ./t  for  s=l,2,...,r.  For  each  j,  place  points 

J  I”bl  J.  L. 

<T 2 j » <r3j '  •  •  • '  j  on  the  fl j ~l  lines  >  1^._2 '  * "  ’ ' II.  +1  f  vertically 

above  cr^,  to  complete  the  point  process  N.  Clearly  the  probability  that  a 
point  appears  on  I4r_1  above  <r^j  is  just  P{ /i j >  2 }=t r_3_/T r  and  the  deletions  are 
independent,  so  that  N^r  ^  is  obtained  on  Lr_1  as  an  independent  thinning  of 
the  Poisson  process  .  Hence  N^r  ^  is  a  Poisson  process  with  intensity 
r r(Tr_]_/Tr)=rr_i  as  expected.  Similarly  is  obtained  on  1^  as  an 

independent  thinning  of  with  deletion  probability  1-t^Tj^,  all 

being  Poisson.  The  main  theorem  is  as  follows. 


Theorem  1.3.2.  The  point  processes  Nn  defined  in  the  plane  as  above  (with 
levels  satisfying  (1.3.2))  converge  in  distribution  as  m a  to  the  point 
process  N  consisting  of  r  successively  thinned  Poisson  processes. 

Again  the  proof  of  this  may  be  accomplished  by  showing  that 
(Nn(Bi) , . . .  ^(B^.) )  ^  (N(B1) , . . .  ,N(Bk) )  for  each  choice  of  k  and  rectangles 
Bi , . . . ,B^,  the  calculations  being  more  involved  but  similar  to  the  one¬ 
dimensional  case. 


nti 
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Theorem  1.3.2  may  be  used  to  give  the  asymptotic  joint  distribution  of 
extreme  order  statistics  (and  their  locations) .  For  example  the  following 
result  concerns  the  maximum  and  second  largest  values . 

Theorem  1.3.3  Suppose  that  the  maximum  MJi(=M^)  has  the  asymptotic  distribution  G, 

(1.3.3)  P{a  (M^-b  )  <  x)  ->  G(x)  as  m ® . 

n  n  n 


Vv) 


•>  V 


Then  for  x^  >  x2, 

d.3.4)  P(an(<1)-b„)  5  xr  an(Mn(2»-bn)  <  x2)  4 

G(x2)  (log  G(x1)  -  log  G^)  +  1)  as  m®. 


This  may  be  proved  by  writing  =  -log  G(x^),  un(Tr)  =  xi/an+bn' 


•  **  w*1 


i=l,2  and  noting  that  (1.3.3)  implies  (using  Theorem  1.2.5)  that 
n(l-F(un(r^) ) )  -»t.  i=l,2.  The  left  hand  side  of  (1.3.4)  is  then  just  (writing 
N^k)  for  ((0,1])), 

P(Mn1}'  un(Tl}'  ^n2)  5  un(T2)}  =  p<Nn2)=0>  +  N^-l) 

which  converges  to  the  same  probabilities  with  replacing  N^.  This  is 

-7  -T-  n 

readily  seen  to  be  e  +  (T2-Tl^e  which  is  just  the  right  hand  side  of 
(1.3.4) . 

The  "r-level"  result  Theorem  1.3.2  allows  the  joint  asymptotic 
distribution  of  r  extreme  order  statistics  to  be  obtained.  On  the  other  hand 
these  results  may  be  summarized  in  one  theorem  commonly  referred  to  as  a 
"complete"  convergence  result,  and  which  concerns  convergence  of  the  process 
values  themselves  (suitably  normalized)  regarded  as  a  point  process  in  the 
plane.  This  result  is  intuitively  satisfying  and  in  the  i.i.d.  case  it  may  be 
regarded  as  the  fundamental  result  yielding  all  the  relevant  asymptotic 
distributional  properties.  On  the  other  hand  when  dependence  is  introduced 
into  a  sequence  the  "particil"  r-level  results  require  somewhat  less 
assumptions  than  does  the  "complete"  result. 

Theorem  1.3.4  Let  be  i.i.d.  with  marginal  d.f.  F.  Suppose  that 

u^C? ),  satisfying  (1.3.2)  for  each  t> 0,  is  continuous  and  strictly  decreasing 

in  7  for  each  n.  Let  denote  the  point  process  in  the  plane  consisting  of 

the  points  (j/n,  u1( ?  . ) )  where  u  1  denotes  the  inverse  function  of  u  (7), 
n  j  n  n 

defined  on  the  range  of  the  r.v. ’s  {£ ^) .  Then  N*  3  N*,  where  N*  is  a 
Poisson  process  on  (0,co)x(0,o)  having  Lebesgue  measure  m  as  its  intensity. 

Proof.  Again  this  is  readily  proved  from  convergence  of  finite  dimensional 


distributions.  For  a  rectangle  B=(a,b]x[c,d) ,  Nn(B)  is  clearly  binomial  with 

parameters  ( [nb]-[na]  ,pn)  where  Pn=P{c  <  <  d}  =  F(un(c) )  -F(un(d) 

★ 

(d-c)/n,  so  that  ([nb]-[na])p  -»  (b-a) (d-c)  =  m(B) .  Hence  Nn(B)  converges 

in  distribution  to  a  Poisson  r.v.  with  mean  m(B) .  If  B^=(a^,b^]xU  [c^  ,d^) 

where  the  intervals  (all]  are  disjoint,  it  follows  along  similar  lines  that 

(N*(Bi) . .  ) )  £  (N*(B1)...N*(B}c))).  It  is  readily  seen  from  this  that 

the  same  holds  when  B^  are  arbitrary  rectangles  of  the  form  (a^,b^]x[c^,d^)  which 

is  sufficient  (cf.  Sec.  2.3)  to  show  that  N  “  N  . 

n 

As  noted  above  this  theorem  summarizes  a  whole  spectrum  of  asymptotic 
distributional  results  for  maxima  and  extreme  order  statistics  in  the  i.i.d. 
case.  For  example  P{Nn((0,l]x[0,r))  <  r}  =  P{M^  <  un(T)}  93 
checked  so  that  the  limiting  distribution  of  can  be  obtained.  When 
has  the  asymptotic  distribution  G  as  in  (1.2.1)  we  may  take  u^r)  =  e^1G  1(e-T)+b 
as  is  readily  checked.  In  that  case  Theorem  1.3.4  may  be  readily  transformed  to 
give  the  following  form  (writing  xQ  =  inf{x:  G(x)  >  0}). 

Theorem  1,3.5  Suppose  (1.2.1)  holds,  for  the  i.i.d.  sequence  {£j},  and  let 

I 

Nn  be  the  point  process  in  the  plane  with  points  at  (j/n,  an(£  j“bn)) .  Then 

'  '  i 

N  *♦  N  on  (0,®)  x  (Xq, oo )  where  N  is  a  Poisson  process  whose  intensity 
measure  is  the  product  of  Lebesgue  measure  and  that  defined  by  the  increas¬ 
ing  function  log  G(y) . 

This  form  of  the  result  was  first  proved  by  Pickards  ([71])  and  is  more 
transparent  when  linear  normalizations  give  an  asymptotic  distribution  for  Hn- 
(Theorem  1.3.4  applies  to  linear  or  nonlinear  normalizations).  For  example  it 
is  clear  that  P{an(M^r^ -b  )  S  x)  =  P(N  ((0,1]  x  (x,a>))  <  r-1}  -> 

I 

P{N  ((0,1]  x  (x,®))  <  r-1}  from  which  (1.2.9)  follows  simply. 


2.  Extremes  of  sequences 

2.1  Hie  Extremal  Types  Theorem  for  stationary  sequences. 

In  this  section  it  will  be  shewn  that  the  Extremal  Types  Theorem  still 
holds  for  (strictly)  stationary  sequences  under  weak  dependence  assumptions. 
Obviously  some  form  of  restriction  on  the  dependence  structure  of  the  sequence 
is  necessary  to  obtain  nontrivial  results  since  e.g.  one  might  take  all  £  ^  to 
be  equal  with  arbitrary  d. f . ,  so  that  would  also  have  this  assigned  d.f. 

Then  in  the  next  section  we  shall  see  that  the  introduction  of  dependence  does 
not  typically  alter  the  limiting  distributional  type  for  the  maximum  and  will 
explore  the  precise  changes  involved. 

Loynes  ([61])  first  obtained  a  form  of  the  Extremal  Types  Theorem 
under  dependence  -  assuming  strong  mixing.  Weaker  (distributional)  conditions 
will  suffice  and  will  be  used  here.  The  difference  is  not  too  important  for 
our  present  purposes  since  the  main  ideas  of  proof  are  essentially  the  same. 
The  main  condition  to  be  used  (termed  D(un) )  is  defined  with  reference  to  a 
sequence  (un)  of  constants  in  terms  of  the  finite  dimensional  d.f. 's 


F.  .  (x  , ...x  )  =  P{£.  <  x  ,...£.  <  x  }  of  the  stationary  sequence  {£  }. 

J-  •  •  •  X  X  il  -1—  X  X  ii  *  * 

in  CL  n 

Writing  F.  .  (u)  =  F.  .  (u,u,...u),  define 
X1 ' ’ * ’ 1n  1l'*,*1n 

«n  1  =  i  i  -i  (UJ  -  Fi  i  (UJ  Fi  i  (un)!  : 


(un) I  = 


15  ii  <  h  <  AP  <  ji-**  <  V  -  n'  ji_ip  2  1} 

Then  D(u  )  is  said  to  hold  if  a  ,  ->  0  for  some  sequence  1  =  o(n)  . 
n  n,l  ^  n 

It  is,  incidentally,  obviously  possible  to  weaken  the  condition  D(un) 
very  slightly  to  involve  "intervals"  of  consecutive  integers  (See  O'Brien 
([68])  for  the  details  of  such  a  procedure  and  for  some  advantages  in 


application  to  periodic  Markov  chains.) 


The  following  result  is  basic  for  the  discussion  of  arri  shows  the 
form  in  which  D(un)  entails  approximate  independence.  It  is  stated  in  a 
somewhat  more  general  form  than  needed  in  this  section. 


Lemma  2.1.1.  Let  {u^}  be  a  sequence  of  constants  and  let  D(u^)  be  satisfied 
by  the  stationary  sequence  { £n) .  Let  {k^  >  1)  be  constants  such  that  1^=0 (n)  and 
(in  the  notation  used  above  for  DCu^)),  k^l^Cn),  k^a^  j  ->  0.  Then 

k 

P{M^  <  un)  -  P  “{M^  <  un)  -»  0  as  mm, 

n 

where  rn  =  [n/k^] . 

The  proof  of  this  result  is  perhaps  the  key  method  in  dependent 
extremal  theory.  The  type  of  argument  was  used  first  in  this  context  by 
Loynes  ([61])  but  was  used  earlier  in  dependent  central  limit  theory  (cf. 

[15]). 


The  basic  idea  is  to  divide  the  integers  l,2,...,n  into  "intervals" 

II'  h'  V  *2  **#Ik  '  Xk  '  where  I1  =  {1'2’'-rn~1n) '  1lm^rT1n+1,,"rn) 
I2=(rn+1'  •  •  *2rn-In)  and  so  on  in  this  fashion  except  for  the  last  interval 

•k 

Ij.  =  (K^rn-In+l, . .  ,n) .  Thus  1^,1  ...I.  are  large  intervals  separated  by 
smaller  (but  typically  expanding)  intervals  I  ,1  . .. 

(cf.  ([54])  for  details)  are 


The  steps  of  proof 


*h 


(i)  Approximate  P{Mn<un)  by  P(_n  (M(I..)<un)}  (using  M(E)  to  denote 


max  {^:  j«E}).  This  simply  reflects  the  fact  that  the  maximum 


is  likely  to  occur  on  the  larger  intervals. 


(ii)  Approximate  P{  n  (M(I^)  <  un) }  by  P  n(M(I1)  <  un)  using  D(un) . 
k.  k 

(iii)  Approximate  P  (M(I1)sun)  by  P  -un)/  the  maximum  of  the  first 

n 

r  of  the  £ , *s  being  likely  to  occur  on  I. . 


The  Extremal  Types  Theorem  new  follows  simply  from  this  result. 


Theorem  2.1.2.  (Extremal  Types  Theorem  for  Stationary  Sequences)  Let  (£n) 
be  a  stationary  sequence  such  that  Mh=max(£ ?2, . . . ,  fn)  has  a  non-degenerate 
limiting  distribution  G  as  in  (1.2.1).  Suppose  that  D^)  holds  for  each  un 
of  the  form  un=%/an+bn,  for  x  with  0  <  G(x)  <  1.  Then  G  is  one  of  the  three 
classical  extremal  types. 

Proof.  Writing  un=x/an+bn  it  follows  that  -*  G(x)  and  D(u^)  holds  (at 

continuity  points  of  G) .  By  Lemma  2.1.1  by  putting  kn=k,  fixed,  and  then 
replacing  n  by  nk  we  have  “  pk^Mn£un}c^0  50  that  PlM^u-J+G^tx) 

or  P<ank<Vbnk>  - x)  ->  G1//^(x) ,  Jc=l,2,...  .  But  this  implies  that  G  is  max 
stable  by  Lemma  1.2.3  and  hence  an  extreme  value  d.f. 

2.2  The  Extremal  Index 

While  the  introduction  of  dependence  into  a  sequence  can  significantly 
affect  various  extremal  properties,  it  does  not,  within  broad  limits,  affect 
the  distributional  type  for  the  maximum.  The  purpose  of  this  section  is  to 
make  that  rough  statement  precise  and  to  explore  the  explicit  changes  brought 
by  a  dependence  structure.  This  depends  essentially  on  a  single  parameter 
sometimes  called  the  "extremal  index"  of  the  (stationary)  sequence  (£n). 

Following  Loynes  ([61])  it  will  be  convenient,  for  a  given  stationary 

A 

sequence  (£n),  to  define  the  associated  independent  sequence  { f  „ }  to  be 
i.i.d.  with  the  same  d.f.  F  as  and  to  write  Mh=max(i y  £ 2, . . .£n) ,  with 
Mn=max($  1,  l2, . . .  $  )  as  before.  As  noted  originally  for  strongly  mixing 
sequences  in  [61],  if  un  =un(r)  satisfies  (1.2.4)  for  each  t,  i.e. 

(2.2.1)  n[l  -  F(un(r))]  -»  t 

then  any  limit  (function)  for  P(Mn  -  1^(7))  must  be  of  the  form  e- 9  7  with 


fixed  e  t  [0/1]  rather  than  just  the  function  e_T  given  by  (1.2.3)  in  the 
i.i.d.  case. 

If  P{^  <  un(r)}->  e-0T  for  each  t>0,  with  un(r)  satisfying  (2.2.1), 
we  say  that  the  stationary  sequence  {{  }  has  extremal  index  0  (iO).  This 
definition  does  not  involve  any  dependence  restriction  on  the  sequence  { t n) . 
The  following  result  shows  that,  under  D(u^)  conditions  any  limit  for 
P(M  <u  (t) }  must  be  of  this  form. 

Lemma  2.2.1.  For  the  stationary  sequence  (£n)  and  constants  (un(r)} 

satisfying  (2.2.1)  suppose  that  0(^(7))  holds  for  each  r>0.  Then 

there  exist  constants  9,  8',O<0<0'£1  such  that  lim  sup  P(Mnsun(r))= 

, 

e~0T,  lim  inf  P(r^<  un(7)}  =  e'0T  for  T'  50  that  if  -  un(T)) 

rn  <a  , 

converges  for  some  r>0  then  0  =0  and  -  un(T))  •*  e-0T  for  all  r>0  and 

(t  )  has  extremal  index  0,  0  <  0  <  1. 

This  result  is  proved  by  using  Lemma  2.1.1  to  show  that 
vt(T)=lim  sup  P{l^<un(T)}  satisfies  ^(7/k)  =  for  each  k=l,2,...,  to 

give  the  exponential  limit.  The  details  of  this  proof  may  be  found  in  [54], 
Note  that  it  follows  in  the  course  of  the  proof  that  9<1. 

Clearly  for  any  i.i.d.  sequence  for  which  (1.2.6)  holds  (so  that 
un(T)  may  be  found  to  satisfy  (2.2.1))  has  extremal  index  0=1.  A  stationary 
sequence  (£n)  satisfying  D(un(r))  for  each  t>0  also  has  extremal  index  0=1  if 

[nA] 

(2.2.2)  lim  sup  n  l  P{  ?  >  u  ,  f  .  >  u  )  ■*  0  as  k-  «. 

n-»03  j=2  3 

For  proof  see  [55]  where  (2.2.2)  is  referred  to  as  D'(un)). 

Many  stationary  sequences  satisfy  (2.2.2),  including  normal  sequences 
with  covariance  sequence  (rn)  satisfying  the  "Berman  Condition"  rn  log  n  ->  0. 
Sufficient  conditions  for  values  of  0<1  are  given  in  [54],  and  an  example 


with  0=1/2  appears  later  in  this  section.  Examples  can  be  found  where  the 
extremal  index  is  zero,  or  does  not  even  exist.  This  obviously  has  some 
theoretical  interest  but  appears  to  occur  in  somewhat  pathological  cases  and 
will  not  be  pursued  in  the  present  discussion. 

The  usefulness  of  the  extremal  index  appears  from  the  following 

result. 

Theorem  2.2.2  Suppose  that  the  stationary  sequence  {£  )  has  extremal  index 
0>O.  Let  {vn}  be  any  sequence  of  constants  and  p  any  constant  with  0<p<l. 
Then  ^  vn)  -ip  if  and  only  if  P{M^  <  vn)  -*  p9. 

This  result  makes  no  assumption  about  dependence,  and  is  readily  shown 
by  obvious  arguments  (cf.  [54]  for  details). 

The  following  result  now  follows  as  a  corollary,  by  taking  vn=3^an+bn 
in  the  theorem. 

Theorem  2.2.3  Let  the  stationary  sequence  { £n>  have  extremal  index  0>O.  If 
P^an^h~bn^  -  x)  G(x)  p^an^Vfbn^  -  ^  G9  (x)  and  conversely.  That 

A 

is  Mn  has  an  asymptotic  distribution  if  and  only  if  does,  with  the  power 
relation  between  the  limits  and  the  same  normalizing  constants. 

By  way  of  comment,  note  that  G9  is  of  the  same  type  as  G  if  one 
of  them  is  of  extreme  value  type  (e.g.  [exp(-e  x)  ] 9  =  exp[-e  '1'0^  ^],  and 

similarly  for  type  II  and  III) .  If  9=1  the  limits  for  and  are 
precisely  the  same.  Indeed  for  0<9<1  the  limits  may  also  be  taken  to  be  the 
same  by  a  simple  change  of  normalizing  constants. 

The  practical  implication  of  this  result  is  that  one  often  does  not 
need  to  be  concerned  about  possible  dependence  in  the  data  when  applying 
classical  extreme  value  theory.  Indeed  one  may  not  have  to  worry  about  the 


precise  value  of  the  extremal  index  since  this  only  alters  parameters  of  the 
distribution  which  usually  must  be  estimated  in  any  case.  Further,  if  6>0, 
the  fact  that  the  distributional  type  under  dependence  is  the  same  as  under 
independence  means  that  the  classical  domain  of  attraction  criteria  may  be 
applied  to  the  marginal  d.f.  of  the  terms  to  determine  which  type  applies. 

The  following  simple  example  provides  a  case  where  6<1,  and  will  also 
be  useful  later  when  the  effects  of  the  value  of  0  on  the  clustering  of 
exceedances  will  be  discussed. 


Example  2.2.4  Let  r,  ,r,  . . .  be  i.i.d.  with  d.f.  H  and  write  £  .=max(*)  . ,  r, .  ) . 

±  *  3  j 

Then  (£n)  is  stationary  with  d.f.  F  =  H2  and  an  easy  calculation  shows  that 
if  un(r)  satisfies  (2.2.1)  then  n[l-H(u  (t) )  ]  a  r/2  and 

P{Mh  iun(T))  =  P{max  (V’V  £  VT>>  P{T,n+l  '  un(7))  4  e"T/2 
so  that  (£n)  has  extremal  index  0=1/2. 


Criteria  for  determining  the  extremal  index  are  discussed  in  [  54  ] . 
Finally,  we  note  that  an  interesting  approach  to  the  relating  of  dependent  and 
i.i.d.  cases  has  been  given  recently  by  O'Brien  [68].  This  is  based  on  the 
general  result 


p{Mn  <  un}  -  F(un) 


nP{max  (f  2,S3,  •  •  •  Sn)  <  |  ^  >  un) 


-»  0 


which  is  shown  in  [68]  to  hold  under  weak  dependence  conditions,  for  a  wide 
variety  of  sequences  (un)  and  integers  pn  ->  co  with  pn=o(n) . 


2.3  Relevant  point  process  concepts. 

In  dealing  with  dependent  cases  it  will  be  necessary  to  be  somewhat 


more  formal  than  previously  in  the  use  of  point  process  methods.  Here  we 
establish  the  notation  and  framework  (substantially  following  Kallenberg 
([53])) ,  and  review  a  few  key  concepts  which  will  be  needed. 


In  general  a  point  process  is  often  defined  on  a  locally  compact 
second  countable  (hence  complete  separable  metric)  space  S,  though  here  S  will 
invariably  be  a  subset  of  the  line  or  plane.  Write  S  for  the  class  of  Borel 
sets  on  S  and  B=3  (S)  for  the  bounded  (i.e.  relatively  compact)  sets  in  S.  A 
point  process  $  on  S  is  a  random  element  in  M=M(S) ,  the  space  of  locally 
finite  (i.e.  finite  on  B(S))  integer-valued  measures  on  5  where  M  has  the 
vague  topology  and  Borel  <r -field  M  =  M( S)  . 

Write  F  =  F(S)  for  the  class  of  non-negative  S -measurable  functions, 
uf  =  Ifdu  for  u  (  M,  f  e  F(S ).  The  distribution  P£  1  of  a  point  process 
f  is  uniquely  determined  by  the  distributions  of  (E(I  )  ...  £  (I  ) ) , 

-L  K 

k=l ,  2  .  .  .  ,Ij  £  T  if  T  is  any  semiring  whose  generated  ring  is  B.  The 
distribution  of  £  is  also  uniquely  determined  by  the  Laplace  Transform 

-Sf 

Lj  (f)  =  Ee  ,  f  (F. 

A  (general)  Poisson  Process  with  intensity  measure  X  has  the  Laplace 
Transform  L£ (f)=exp(-x (1-e  f) }  whereas  a  Compound  Poisson  Process  has  Laplace 
Transform 

(2.3.1)  Lt  (f)  =  exp  {-X (l-Lflof) } 

'•••here  6  is  a  positive  integer-valued  random  variable  with  Laplace  Transform 
Ljj(t )=Ee  This  consists  of  multiple  events  of  (independent)  sizes  R 

located  at  the  points  of  a  Poisson  Process  having  intensity  measure  x  . 

Convergence  in  distribution  of  a  sequence  { £n)  of  point  processes  to  a 
point  process  £  is,  of  course,  sinply  weak  convergence  of  P^1  to  P£  1. 

It  may  be  shewn  (cf.  [53])  that  £^  ->  £  if  and  only  if  Lj  (f)  -  Lt  (f)  for  ever 
f  (  F  ,  the  subclass  of  F  consisting  of  the  nonnegative  continuous  functions 
with  compact  support.  Point  process  convergence  is  also  equivalent  to  con¬ 
vergence  of  finite  dimensional  distributions.  Even  more  simply  £  -  $  if  and 

only  if  (£  (I  )  ...  £  (I  ))  -  (((I,)  ...  £  (Iv))  k=l,2...  I.  £  T  where  T  ;  3 


is  a  semiring  such  that  £  (3B)=0  a.s.  for  each  BeT,  and  such  that  for  any 
B  (  B,  t  >  0,  B  may  be  covered  by  finitely  many  sets  of  T  with  diameter  less 
than  €  (cf.  [53  Theorem  4.2]).  The  semiclosed  intervals  and  rectangles  used 
in  Section  1.3  form  such  classes  and  hence  e.g.  (1.3.1)  is  indeed  equivalent 
to  full  convergence  in  distribution  of  to  N. 

Finally  the  dependent  counterpart  of  Theorem  1.3.4  requires  the 
concept  of  infinite  divisibility.  A  point  process  £  is  said  to  be  infinitely 
divisible  if  for  each  n=l,2,...  there  exist  some  independent  and  identically 
distributed  point  processes  £1,.../£n  such  that  £  =  £^  +  £2  •••  +  £n-  The 
Laplace  Transform  of  an  infinitely  divisible  point  process  has  the  canonical 
representation 

(2.3.2)  -  log  L t ( f )  =  I  (l-e-“f)P(d u) 

M\{0} 

where  P  is  a  measure  on  M\{0)  such  that  f  ( l-e~ « ( B > ) P (du)  <  « 

M\{0) 

for  all  B  <  s.  P  is  referred  to  as  the  canonical  measure  of  £. 

2.4  Convergence  of  point  processes  associated  with  extremes 

We  return  now  to  the  stationary  sequence  (£n)  and  consider  point 
process  convergence  results  along  the  same  lines  as  for  the  i.i.d.  case  in 
Section  1.3.  The  notation  of  that  and  other  previous  sections  will  be  used. 
In  particular  N  will  denote  the  point  process  of  exceedances  on  (0,1]  as 
defined  in  Section  1.3,  viz  Nn(E)=#(i/n  e  E:  £  ^  >  u  ,  lsisn),  for  a  given 
sequence  of  constants  u  . 

When  (£n)  has  extremal  index  6=1,  the  Poisson  convergence  result, 
Theorem  1.3.1,  may  be  proved  provided  D(u  )  holds.  This  leads  again  to  the 
classical  form  (1.2.9)  for  the  asymptotic  distributions  of  extreme  order 
statistics.  Similarly  Theorem  1.3.2  holds  under  an  r-level  version  Dr(un)  of 


D(un)  (cf.  [55,  p.  107])  leading  to  the  classical  forms  for  the  asymptotic 
joint  distributions  of  extreme  order  statistics  when  6=1  (cf.  Theorem  1.3.3). 

The  "complete  convergence"  result  Theorem  1.3.4  also  holds  giving  again  a 
Poisson  limit  in  the  plane  when  6=1  provided  the  multilevel  conditions  Dr(un) 
all  hold.  These  results  are  described  in  [55] ;  here  we  indicate  the  new 
features  which  arise  when  0<6<1. 

As  noted  in  Section  2.2,  cases  when  8<1  occur  when  there  is  "high 
local  dependence"  in  the  sequence  so  that  one  exceedance  is  likely  to  be 
followed  by  others  (see  Example  2.2.4  as  an  illustration  of  this).  The  result 
is  a  clustering  of  exceedances,  leading  to  a  compounding  of  events  in  the 
limiting  point  process. 

To  include  cases  where  such  clustering  occurs  (i.e.  0<8<1)  we 
require  the  following  modest  strengthening  of  the  D(un)  condition  (cf.[50]). 

Let  B^(un)  be  the  <r -field  generated  by  the  events  { £s  <  un),  i<s<j. 

For  l<l<n-l  write 

(2.4.1)  J3n^  =  max{  |  P(A  n  B)-P(A)P(B)|  :  AeB*(un),  BeB^+I  (un)  ,  1  <  k  <  n-2  } 

Then  the  condition  a  (u  )  is  said  to  hold  if  li  ,  ->  0  for  some  sequence 

n  n'^ 

ln  with  2n=o(n) .  (Bn  will  be  called  the  mixing  coefficients  for 
a.  The  condition  a  is  of  course  stronger  than  D  but  still  significantly 
weaker  than  strong  mixing. 

The  condition  a  will  be  applied  through  the  following  lemma  which 


is  a  special  case  of  [84,  Equation  I'). 

Lerrv:  2.4.1.  For  each  n  and  lsl5n-l  write  Y 


sup  Ei)  f  -Ei i  Ei  1 


where  the  supremum  is  taken  over  all  rj ,  {  measurable  with  respect  B1  (t^) , 

( un)  respectively,  0  <  -n  <1,  15  j<  n-I ) .  Then  j  -  7  n  j  -  4^n  2 

where  3^  ^  is  the  mixing  coefficient  for  a,  given  by  (2.4.1).  In 


’  .  -  T>  V V  V  V  V  V  V  V  V  V  V  V  V/«' AV,  A  A.  .V  ’  .  •  ,  * 


particular  {£  }  satisfies  a  (u  )  if  and  only  if  y  ,  -»  0  for  some  1  =o(n) . 

I  >  i  1  1 1 1  j.  1 1 

'  n 

It  will  be  convenient  to  have  the  following  simple  notion  of  clusters. 

Divide  the  into  successive  groups  ),  (f  , . . . ,  £2r  )  •••of 

n  n  n 

rn  consecutive  terms  where  rn  (=o(n))  is  appropriately  chosen.  Then  all 
exceedances  of  u^  within  a  group  are  regarded  as  forming  a  cluster.  Note  that 
since  rn=o(n)  the  positions  of  the  members  of  a  single  cluster  will  coalesce 
after  the  time  normalization,  giving  nearly  multiple  points  in  the  point 
process  Nn  on  (0,1].  The  following  lemma  shews  that  the  clusters  are 
asymptotically  independent. 

Lemma  2.4.2.  Let  r>0  be  constant  and  let  a  (u^)  hold  with  un=un(T)  satisfying 

(2.2.1).  Suppose  {k^}  is  a  sequence  of  integers  for  which  there  exists  a 

sequence  (1^)  such  that  k^l ^/n  ->  0  and  k^Bn  j  -*  0,  where  B^  ^  is  the 

'  n  ' 

mixing  coefficient  of  a (u^) .  Then,  for  each  non-negative  continuous  f  on 


(0,1], 


(2.4.2) 


n  K,  irn 

E  exp(-I  f  (j/n)  X  .)  -  “n  E  exp(-:  n 


j=(i-l)r+l 


f  (j/n)\, 


where  X  .  is  the  indicator  l,t  .  and  r  =[n/X  ]. 
n,j  { L  >  u  ^  n  L  v  nJ 

This  result  is  proved  by  the  standard  basic  technique.  Here  the 

"intervals"  (l,...,r  ),  (r,,...,2r  )  ...  are  each  shortened  by  omitting  the 

n  n+i  n 

final  1  integers  of  each,  and  successive  approximations  made  for  the  first 
term  of  (2.4.2)  in  a  similar  way  to  the  argument  of  Lemma  2.1.1,  VaS  ir.q 
Lemma  2.4.1.  (See  [50]  for  details). 

The  number  of  exceedances  in  the  i^1  cluster  is  N  ( (i-l)r^/n,  ir  ■/r.] ) 
ir 

=  I  x  and  the  cluster  size  distribution  is  therefore 

j=(i-l)r+l  n'^ 

conveniently  defined  to  be  given  by 


V.V  v  -.-V  •  VV-J.V'.- 


The  following  result  gives  sufficient  conditions  for  to  have  a  Compound 
Poisson  limit. 

Theorem  2.4.3.  Let  the  stationary  sequence  { have  extremal  index  0>O,  and 
suppose  that  the  conditions  of  Lemma  2.4.2  hold.  If  7in(i)  (defined  by 
(2.4.3))  has  a  limit  tt  (i)  for  each  1=1,2,...,  then  m  is  a  probability 
distribution  on  1,2,...  and  the  exceedance  point  process  converges  in 
distribution  to  a  Compound  Poisson  Process  N  with  Laplace  transform 

1  CO 

(2.4.4)  L-.(  f )  =  exp  (-6  t  ;  (1-  £  7T  ( i ) )  dt) 

”  0  i.=l 

Proof.  The  Laplace  Transform  L^  (f)  of  Nn  is  precisely  the  first  term  of 
(2.4.2)  and  hence  may  be  approximated  by  the  second  term.  This  latter  term 
may  be  manipulated  by  using  the  facts  that  for  large  n,  f(j/n)  is 

ir 

approximately  constant  in  (i-l)rn  <  j  <  irn,  and  Zn  =  E  has  the 

distribution  P{Zn=i)  =  l-pn  or  Pn7Tn(i)  according  as  i=0  or  i>u,  where 
Pn=P{Nn(  (0,rn] )  >  0}  -  Gr/k^  from  Lemma  2.1.1. 

The  Laplace  Transform  (2.4.4)  is  of  the  form  (2.3.1)  with  the  integer 
valued  r.v.  /3  satisfying  P{6=i)  =  m(i)  and  intensity  measure  simply  0Tm  where 
m  is  Lebesgue  measure.  That  is  N  consists  of  multiple  events  of  size  whose 
distribution  is  7T(i),  located  at  the  points  of  a  Poisson  Process  having 
intensity  9  r  . 

The  following  result,  shewing  that  the  Compound  Poisson  Process  is  the 
only  possible  limit  for  N  under  the  conditions  a  is  proved  along  similar 
lines  to  Theorem  2.4.3.  (Full  details  may  be  found  in  [50]). 

Theorem  2.4.4.  Suppose  t>0  is  constant  and  the  condition  a  (u  )  holds 
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(^=1^(7)  satisfying  (2.2.1))  for  the  stationary  sequence  {{j}.  If  Nr 
converges  in  distribution  to  same  point  process  N,  then  the  limit  must  be  a 
Compound  Poisson  Process  with  Laplace  Transform  (2.4.4)  where  m  is  same 
probability  measure  on  {1,2,...}  and  0  =  -7  1  log  lim  P{^  <  un(r)}  e  [0,1]. 

If  0^0  then  7T { i >  =  lim  mn(i}  where  is  defined  by  (2.4.3)  for  r^n/k^], 
k^co)  being  any  sequence  chosen  as  in  Lemma  2.4.2. 

Example  2.4.5.  (Example  2.2.4  continued)  It  is  evident  that  the  exceedances 
of  un  by  the  process  £ j=*max(t) j , in  Example  2.2.4  occur  in  (at  least) 
pairs,  since  if  <  un  but  >  u^  then  t)j+1  >  un  and  hence  f ^+1  >  u^.  It 

is  readily  seen  by  direct  evaluation  that  7in(2)  ->  1  and  hence  m(i)=l  or  0 
according  as  i=2  or  ij*2.  Thus  the  limiting  point  process  N  consists  entirely 
of  double  events  and  (2.4.4)  gives 

Ifc(f)  =  exp{-(7/2);01  (l-e~2f(t))dt}. 


The  most  important  application  of  the  Compound  Poisson  limit  is  to 
give  the  asymptotic  distribution  of  the  Xth  largest  value  of 
when  0<1,  using  the  relationship 


(2.4.5) 


P{Mhk)  -  VT))  =  P{Nn((0'1])  5  k_1} 


Theorem  2.4.6.  Surroose  that  for  each  7>0.  a  (u  )  holds  with  u  =u  (7) 

-  -  ^  n  n  n 

satisfying  (2.2.1)  and  that  Nn(=Nn  )  converges  in  distribution  to  some  non¬ 


trivial  point  process  N  (=N  )  (which  will  occur  e.g.  if  the  conditions  of 

Theorem  2.4.3  hold) .  Assume  that  the  maximum  has  the  non-degenerate 

asymptotic  distribution  G  as  given  in  (1.2.1).  Then  for  each  k=l,2,... 

fkl  i  *i  . 

(2.4.6)  P{a  (Ml*'-b  )  <  x)  -»  G(x)  [1+  I  l  ( (-log  G(x) )  J/D ! )  *  J  U)  ] 

n  n  n  j=l  i=l 


; v>.*.  -U/.  ^ 


(zero  if  G(x)=0),  where  7T ^  is  the  j-fold  convolution  of  the  probability 
tv  =  lim  Tvn,  7in  being  given  as  in  Theorem  2.4.4. 

Proof.  It  follows  from  Theorem  2.4.4  that  (£n)  has  extremal  index  9>0  and 

by  Theorems  2.2.3  and  1.2.5  that  u  (r)  =  a  ^x+b  satisfies  (2.2.1)  with 

n  n  n 

r  =  t  (x)  =  -log  G1  / ®  (x) .  The  result  follows  using  (2.4.5)  since 

k-1  .  k-1  . 

P{N  ((0,1])  <  k-l)  4  P{N((0,1])  <  k-1)  =e-eT{i+  z  (0T)^  z  n  D (i)/j ! 

j=l  i=j 

which  equals  the  right  hand  side  of  (2.4.6). 

Note  that  the  form  (2.4.6)  differs  from  the  (classical)  case  0=1 
(i.e.  (1.2.9)),  by  inclusion  of  the  convolution  terms.  These  arise  since  e.g. 
the  second  largest  may  be  the  second  largest  in  the  cluster  where  the  maximum 
occurs  or  the  largest  in  some  other  cluster.  This  contrasts  with  the  case  k=l 
for  the  maximum  itself  involving  only  the  relatively  minor  change  (Theorem 
2.2.3)  of  replacing  the  classical  limit  by  its  0th  power. 

Finally  in  this  section  we  indicate  the  modifications  required  by  the 
dependence  structure  for  "complete"  convergence  results  such  as  Theorem  1.3.4. 
As  in  that  case  let  Nn  denote  the  point  process  in  the  plane  consisting  of 
points  at  (j/n,  u^1 ( £  ^ ) )  where  u^1  denotes  the  inverse  function  of  u^ ( r ) . 

Under  appropriate  conditions  (including  e.g.  that  (£n)  has  extremal 

•ff 

index  0=1)  Nn  has  again  a  Poisson  limit  N  in  (0,co)  x  (0,a>)  with  Lebesgue 
measure  as  its  intensity.  However,  as  for  the  exceedance  point  process,  the 
limit  may  undergo  "compounding"  when  9<1. 

The  possible  limiting  forms  for  Nn  were  discussed  first  by  Mori  ([65]) 
under  strong  mixing  conditions.  More  recently  a  transparent  derivation  has 
been  given  by  Hsing  ([48])  under  weaker  conditions,  of  a (ufi)  type  but 
involving  multiple  levels  un(T^) •  A  derivation  similar  to  that  for  the 


29 


.  .  *  * 

exceedance  process  shews  that  any  limit  in  distribution  of  N^,  N  say,  must 
have  independent  increments,  be  infinitely  divisible  and  have  certain 
stationarity  properties.  These  properties  restrict  the  canonical  measure  P  of 
N*  to  a  form  which  can  be  readily  determined  (though  requiring  further 
notation) ,  thus  providing  a  specific  expression  for  the  Laplace  Transform  of 

■k  * 

N  .  Rather  more  illuminating,  however,  is  the  "cluster"  representation  of  N 

k 

which  exhibits  N  as  a  Poisson  Process  in  the  plane  together  with  a  countable 
family  of  points  with  integer  valued  masses  on  vertical  lines  above  and 
emanating  from  each  Poisson  point. 

CD 

Specifically  let  5.  .  .  denote  unit  mass  at  (s,t)  and  {  =  Z  5  T  . 

(s,t)  i=1  (bi'xi' 

a  homogeneous  Poisson  Process  on  (0,o)  x  (0,q).  Let  7^,  i=l,2,...,be  i.i.d. 
point  processes  on  [  1 , © )  each  independent  of  l ,  such  that  7^({1})>1,7^  having 
points  of  mass  a. .  at  Y. ..  Then  (under  the  assumed  conditions) , 

N*  a  :  -  aii  5  IS  T  Y  ' 
i  j  J  (Si,TiYij' 

■jf 

As  is  clear  from  this  representation  N  has  atoms  at  each  (S^,T^)  (since  the 

smallest  Yj^=l  f°r  each  i)  and  at  points  (S^,  TT«L  j)  lying  vertically  above 

(S.,T.) . 
v  l  r 

As  noted  in  Sec.  1.3,  theorems  of  this  type  summarize  the  relevant 
information  concerning  asymptotic  joint  distributions  of  extreme  order 
statistics,  in  contrast  to  the  individual  marginal  distributions  obtained  in 
Theorem  2.4.6. 


2.5  Normal  sequences:  the  comparison  method. 

For  stationary  normal  sequences  with  covariances  (rn),  the  condition 
L(un)  holds  -  as  also  does  the  sufficient  condition  (2.2.2)  for  the  extremal 
index  to  be  1  provided  the  "Berman  Condition"  holds,  viz. 


•-  t.V*.  *•  *.  % 


0 


(2.5.1) 


r  log  n  -»  0 
n 


as  n  -»  ©. 


These  results  are  simply  proved  by  means  of  a  widely  used  comparison 


method  which,  in  particular,  bounds  the  difference  between  two  (standardized) 
normal  d.f. 's  by  a  convenient  function  of  their  covariances .  This  result  - 
here  given  in  a  general  form  -  has  been  developed  in  various  ways  by  Slepian 

I 

[81],  Berman  [9]  and  Cramer  (cf.[21]). 

Theorem  2.5.1  (Normal  Comparison  Lemma) .  Suppose  that  £ . . . ,  {■  are  standard 
normal  random  variables  with  covariance  matrix  A1  =  (A^j )  and  T>n 

similarly,  with  covariance  matrix  (A0  =  A°  j) ,  and  let  p^=max(  |  A^.  j  ,  |  A0^  |  ) . 
Then,  for  any  real  numbers  u1,u2,...,u  , 

(2.5.2)  P{£j  <  Uj,  j=l,2, . . . ,n)  -  P{ nj  <  u.,  j=l,2,...,n)  < 

( 2  r ) — 1  Z  (.1.  -  A°)  +  (l-p?.f1/2  exp[-(u2.+u^)/ (2 (1+p  .  . ) )  ] 

l<i<j<n  1  J 

where  x+  =  max(x,0).  Further,  replacing  (A^  •  -  A0.  . )+  by  its  absolute  value 

J- J  XJ 

on  the  right  hand  side  of  (2.5.2)  yields  an  upper  bound  for  the  absolute  value 
of  the  difference  on  the  left  hand  side. 


By  taking  £1,£2...  to  be  a  stationary  sequence  of  standardized 
normal  r.v.'s  with  covariance  sequence  (r  }  and  *» 2 •••  ho  be  i.i.d. 
standard  normal  r.v. 's  it  follows  simply  from  the  theorem  that  if 
sup  !  rn!  <1  then  for  any  real  sequence  {u^} , 


(2.5.3) 


.  (u  )  - 
.1  v  n' 
s 


’(u  )|  5  Kn  I  |  r.  I  exp  iL/(l+i  r.  I  ) 
j=l  J  J 


where  F.  .  is  the  joint  (normal)  distribution  of  £  .  ,...,£.  and  ;  is 

Vs  h  ^ 

the  standard  normal  d.f.,  i  . ,.is  being  any  choice  of  distinct  integers  from 


1,2. . .n. 


Now  if  n(l-':(un))  is  bounded  and  (2.5.1)  holds  it  can  be  shewn  (by 

seme  routine  calculation)  that  the  right  hand  side  of  (2.5.3)  tends  to  zero 

as  rxo,  shewing  that  P{£ .  <  u  ,...  £  ■  s  un)  is  approximately  the  same  as 

\  n  ^s 

it  would  be  if  the  r.v. 's  were  i.i.d.  instead  of  being  correlated. 

One  can  clearly  (by  identifying  i. , . . .  i  with  1, . . .n)  then  show 

X  s 

directly  that  P{Mn  <  un)  is  approximately  the  same  as  for  the  i.i.d.  standard 
normal  sequence.  Or  Equation  (2.5.3)  may  be  simply  used  to  verify  the 
conditions  D(u  ) ,  (2.2.2)  and  Theorem  2.2.3  used,  thus  leading  by  either  route 
to  the  following  result. 

Theorem  2.5.2.  Let  (£n)  be  a  (standardized)  stationary  normal  sequence  with 

covariances  (r  }  such  that  r  log  n  a  oasm®.  Then 
n  n  ^ 

P{an(\-bn)  5  x)  *  e*P("e~X) 
where  a^,  bn  are  given  by  (1.2.5). 

Thus  if  rn  log  n  ->  0,  the  maximum  from  the  stationary  normal 

sequence  has  precisely  the  same  asymptotic  distribution  as  an  i.i.d.  normal 

sequence.  The  same  is  true  of  the  distributions  of  all  extreme  order 

statistics.  Although  a  slight  weakening  of  (2.5.1)  is  possible  this  condition 

is  close  to  being  necessary  for  Theorem  2.5.2.  Indeed  if  rn  log  n  ->  7  >  0 

and  u  =  x/a„  +  b  (with  a  ,  b  given  by  (1.2.5))  then  the  time  normalized 
n  n  n  n  n 

point  processes  of  exceedances  converge  in  distribution  to  a  certain  doubly 
stochastic  Poisson  Process.  This  leads  to  the  asymptotic  distribution  of  the 
maximum  given  by  the  convolution  of  a  normal  and  Type  1  extreme  value 
distribution.  (See  [55  Sec.  6.5]  for  details).  Further,  Mittal  and  Ylvisaker 
([64])  have  shewn  that  if  r^  *  0  but  rn  log  n  then  Mn  has  an  asymptotic 
normal  distribution.  Thus  in  these  "highly  dependent"  cases  where  D(u  )  fails 
the  classical  theory  no  longer  applies. 


cCrrfr.i’ifl.rf.r 


As  noted  previously  stationarity  has  been  assumed  in  many  of  the 
results  to  avoid  the  complications  of  notation  and  calculation  which  a 
nonstationary  framework  entails.  For  normal  sequences,  however,  the 
sufficient  correlation  conditions  still  remain  quite  simple  in  nonstationary 
cases.  For  example  the  fol lowing  result  holds. 

Theorem  2.5.3  Suppose  that  (£n)  is  a  normal  sequence  with  correlations  r^ 

satisfying  r.  .  i  p  .  .  for  i;j  where  p  <1  for  all  n  and  p  log  n  ■*  0  as 
l]  1-3  J  rn  n  J 

rn os.  Let  u  .  (is  is n,  n=l,2, . . . )  be  constants  such  that  X  =  min  u  .  > 
ni  n  ni 

n 

c  (log  n)  1 < 2  for  some  c>0.  If  for  some  no,  1(1-  (u  .))  •*  7,  then 
n  1  rLL 

P{  s  u  . )]  ->  e*T  as  n-»®. 

i=l 

Theorem  2.5.3  has  a  very  useful  corollary  in  the  case  where  a  sequence 
{ n n }  is  obtained  from  a  stationary  normal  sequence  { ( n)  by  adding  a  varying 
mean  -  such  as  a  seasonal  component  or  trend.  Calculations  then  show  that  the 
double  exponential  limit  for  the  maximum  still  holds,  but  the  normalizing 
constant  bn  can  require  an  appropriate  modification.  Specifically  suppose 
that  +  nu  where  {£^}  is  a  standard  (zero  mean  unit  variance)  normal 

sequence  (not  necessarily  covariance  stationary)  and  cure  added 
deterministic  components  with  the  property  that 

(2.5.4)  6  =  max  |m.  \  =  o(log  n)  as  n  ->  c?. 

n  lsi<n  1 

,  ,  ,  ,  ★ 
Under  this  condition  it  may  be  shown  that  a  sequence  of  constants  (mn) 

may  be  found  such  that 

(2.5.5)  J  I  exp(a*(mi  -  n£)  -  j  (im  -  m*)2)  ->  1  asn-> 

*  ... 
in  which  a  =  a  -  log  log  n  /(2a  ) .  With  this  notation,  the  following  result 


holds. 


Ihjjorejfl  2.5.4  Let  *)^  =  f^  +  ro^as  above  where  {fnS  is  a  standard  normal 
sequence  with  correlations  r^  satisfying  [  r  i  j  |  <  Pjj^,  for  i£j  with  <  1 
and  Pn  log  n  ^  0.  Suppose  that  (2.5.4)  holds  and  m^  satisfies  (2.5.5).  Then 
=  max  (1>i/1>2'.’.t|n)  satisf^-es 

P{an(Mn  "  bn  _  V  -  x)  4  ^  (“e_X) 

with  an  and  bn  given  by  (1.2.5). 

Thus  the  non-stationarity  in  the  correlation  structure  has  no  effect 
on  the  limit  law,  and  that  introduced  by  the  added  deterministic  component 
is  adjusted  for  by  the  change  of  bn  to  (bn  +  it£)  .  For  details  see  [55, 

Chapter  6] . 

Normal  processes  provide  a  widely  used  source  of  models  for  describing 
physical  phenomena,  and  it  is  gratifying  that  extremal  theory  applies  so 
simply  to  them.  Another  convenient  source  of  models  is  of  course  Markov 
chains,  whose  extremal  behavior  we  discuss  next. 


2.6  Regenerative  and  Markov  sequences 

Most  limit  results  for  Markov  chains  are  intimately  tied  to  the  theory 
of  regenerative  processes.  For  extreme  values,  this  has  been  used  in  [2], 

[91,  some  further  references  on  extremes  of  Markov  chains  being  [10],  [12], 
[68],  The  "classical"  case,  exemplified  by  the  GI/G/1  queue,  is  when  a 
recurrent  atom  exists.  However,  recently  regeneration  techniques  have  been 
extended,  in  [6],  [7],  [67],  to  shew  that  any  Harris  recurrent  chain  (^n)  on 
a  general  state  space  is  regenerative  or  1 -dependent  regenerative  (concepts  to 
be  defined  below) ,  and  to  give  effective  criteria  for  regeneration.  Further, 


■If  feSSSWfl* 


clearly  a  function  £  =f (f  )  of  a  (1-dependent)  regenerative  sequence  is  (1- 
dependent)  regenerative.  An  example  where  this  added  generality  is  useful  is 
given  by  ARMA  (p,q)  -  processes.  They  are  naturally  considered  as  functions 
of  a  Markov  chain  in  can  be  shown  to  be  1 -dependent  regenerative 

under  weak  conditions  but  usually  not  to  be  regenerative  (cf.  (79]) . 

Regenerative  and  1-dependent  regenerative  sequences  are  strongly 
mixing,  and  hence  the  theory  from  Sections  2. 1-2. 4  applies,  in  particular  the 
Extremal  Types  Theorem  and  the  Compound  Poisson  limit  for  exceedances  hold. 
However,  this  can  also  be  obtained  directly,  and  the  direct  approach  gives 
some  added  insight,  also  into  the  results  for  general  stationary  sequences. 

In  the  present  section  this  will  be  briefly  outlined,  along  with  some  results 
directly  tailored  to  Markov  chains. 

A  sequence  { £  t=l,2, . . . >  is  regenerative  if  there  exists  integer¬ 

valued  random  variables  0  <  SQ<S1  <  . . .  which  divide  the  sequence  into 
"cycles" 

°o  -  <£n:  0  5  n  <  so»'  ci  -  (£n;  so  5  n  <  V  c2  *  <£n:  sa  s  n  *  V 

which  are  independent  arvd  such  that  in  addition  c^c^ , . . .  follow  the  same 
probability  law.  Then  (5^)  is  a  renewal  process,  i.e.  Tq=Sq,  T^S^-Sq,  T2=S2~ 
S^, . . .  are  independent  and  T^T^...  have  the  same  distribution.  We  shall 
here  assume  that  m=ET1<cD  and  that  the  distribution  of  T^  is  aperiodic,  i.e. 
that  the  only  integer  for  which  P(T^  t  {d,2d, . . .  ))=1  is  d=l.  The  sequence 

{ £  }  is  1-dependent  regenerative  if  there  exists  a  renewal  process  {S,  )  as 
n  k 

above,  which  makes  Cg,^,...  1-dependent  and  c^ ,  c2 , . . .  stationary. 

Suppose  now  that  {£n:  n=0,l,...)  is  a  stationary  regenerative 
sequence,  let  ,f0=max  0  <  i  <  S  ),  ^  =  max  (f  SQ  <  i  <  s1>^  f 2  =  Tnax 

<  i  <  S2),  ...  be  the  cycle  maxima  and  define  vt=inf{kil:  S^>t } .  By 
the  law  of  large  numbers  y^/t  -»  1/m  a.s.  and  Mn  =  max  ...,£n)  is  easily 


approximated  by  max  >  ),  which  then  in  turn  can  be  approximated  by 

max  {  f  . . . ,  f  /m  } .  Since  f  f 2, . . .  are  i.  i.d. ,  this  can  be  shown  to  lead 


(2.6.1] 


sup  P(Mn;x)  -  G  (x)  I  ->  o, 


as  n  -»  co, 


rith  G (x)  =  P(s  i  x) 1//m,  see  e.g.  [79],  Since  G  is  a  d.f.  it  follows  at 


once  that  the  Extremal  Types  Theorem  holds  for  (£n)/  and  criteria  for  domains 


of  attraction  are  obtained  by  applying  the  criteria  for  i.i.d.  variables  to 


G(x) . 


In  particular  it  follows  from  (2.6.1)  and  Theorem  1.2.5  that  if  n(l- 


G(u  ))  -*  v  then 


(2.6.2) 


P(Mn  5  V  ■*  e"'1' 


cis  n  co  f 


and  conversely  if  (2.6.2)  holds  then  n(l-G(u  ))  ->  t,.  As  in  Section  2.2  let 


r  r 

? 1 ' S 2 ' . .  *  the  associated  independent  sequence  which  has  the  same  marginal 

d.f.  F  as  £ £ 2, . . .  and  write  =  max  { {^  . . . ,  {■  }.  If  in  addition  n(l- 
F(un))=nP(£1  >  un)  -»  t  >  0  then  P^  S  un)  ->  e~ 7  and  {£^}  hence  has  extremal 
index  0  =  tj/r .  Since  l-G(un)  ~  P(f^  >  u^)/m,  this  can  be  formulated  as 


follows.  If  there  exists  a  sequence  (un)  such  that  n(l-F(un))  -»  t  >  0  and 


(2.6.3) 


P(fl  >  un)/m 


P«1  >  V 


then  (ft)  has  extremal  index  0>O.  In  the  same  way  it  can  be  seen  that 
conversely  if  {f  }  has  extremal  index  8>0  then  for  any  t>0  there  exists  a 


sequence  (un)  which  satisfies  n(l-F(u  ))  -»  r  and  (2.6.3).  Further, 


straightforward  arguments  show  that  (2.6.3)  can  be  replaced  by 


(2.6.4) 


P(f1  >  x)/ro 


lim  - ± - 

XTXp  P(£  >  X) 


’  .*•  •  J \- . v  'j  .v*  r3aM..\A.V >  ■>  >  ^ 


L',V^ 

•  V-  J 


■m 

'%-\N 

>  ■* »  *v%  ••  J 
\  “-VI 
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in  this,  with  x^  the  righthand  endpoint  of  the  d.f.  F.  However,  it  should 

be  noted  that  there  are  examples  of  regenerative  sequences  {^}  which  satisfy 

(2.6.2),  even  for  u  =u  (x)  =  x/a  +b  ,  for  all  x,  but  for  which 
n  n  n  n 

(P(f  >un)/m)/P(  i  1  >  un)  does  not  converge,  and  hence  the  extremal  index  does 
not  exist,  even  if  this  is  not  expected  to  occur  in  cases  of  practical 
interest. 

A  counterpart  to  the  Compound  Poisson  limit  Theorem  2.4.3  for  the 
exceedance  point  process  Nn  given  by  Nn(E)  =  #  e  E:  { ^  >  u^) ,  is  also  easy 

i 

to  obtain  for  stationary  regenerative  sequences.  Let  N  be  the  point  process 
on  (0,1]  which  has  points  of  multiplicity  7^  =  #{t:  ?t>un/  -  t  <  }  at 

•  i 

i/n  for  each  i  for  which  7 .  >  0,  i.e.  N  is  defined  by  N  (E)  =  I  7.. 

a  in  n  i/nfE 

Then  (7^}^_  is  an  i.i.d.  sequence,  and  if  (2.6.2)  holds  so  that 
nP(7  >  0)  =  nP((1  >  un)  •*  r)m  and  if 

(2.6.5)  Hn(i)  *  P(71  =  ii  71  >  0)  ■*  Ti(i),  as  mcr, 

I 

for  all  i,  for  some  { rr  (i)  ;  i=l,2,...}  then  it  follows  at  once  that  Nn 

I 

converges  in  distribution  to  a  Compound  Poisson  process  N  with  Laplace 

1  °  -f(t)i 

transform  L^,  (f)=exp(-T)mjQ  (1-  e  ''  '  m(i))dt).  By  definition,  a  non¬ 

zero  7 ^  corresponds  to  a  cluster  of  f.  exceedances  of  u^  by  ^  for  S^_1it<S., 
and  since  S^/i  -*  m  as  i  ->  a>  there  is  hence  a  cluster  of  b  points  located 
approximately  at  mi/n  in  N  .  Hence  for  an  interval  E,  Hn(E)  is  approximated 

i 

by  II  (m  •  E) ,  (for  m  -E  -  (x:  mx  e  E})  and  asymptotically  Nn(E)  should  have 

1  -1 

the  same  distribution  as  N  (m  E)  .  This  argument  can  easily  be  extended  and 
made  stringent,  to  give  the  following  result. 

Theorem  2,6.1  Let  n=0,l,...}  be  a  stationary  aperiodic  regenerative 

sequence  with  m  <  ■=■  which  satisfies  (2.6.4)  and  let  (u^)  be  constants  such 
that  (2.2.1),  i.e.  n(l-F(u  ))  -  r,  and  (2.6.5)  hold.  Then  Nn  converges  in 
distribution  to  a  Compound  Poisson  Process  H  with  Laplace  transform  (2.4.4) , 


r  (f)  =  exp  (-9  t  ;  (1  -  I  e  M  ;  m(i))  dt}. 

N  0  i=l 

These  results  may  also  be  extended  to  1-dependent  regenerative  sequences, 
however  with  some  extra  complexity.  Here  we  mention  that  the  criterion  (2.6.4)  for 
the  extremal  index  to  be  one  then  is  replaced  (cf.  [78])  by 

P(f,  2  x,  U  -  x)/m 

(2.6.6)  lim  - - - - -  =  0. 

x'Xp  P(f1  >  x) 

In  [79],  (2.6.6)  is  further  used  to  find  conditions  for  0=1  for  a 
function  £t=f(*i.  )  of  a  Markov  chain  on  a  general  state  space.  This  result  is 
expressed  directly  in  terms  of  the  transition  probabilities 
Pn(x)=P(f(n;L)>un  ij0=x)  =  P ( f  1>un |  t>0  =  x)  as  follows. 

Theorem  2.6.2  Let  (in)  be  a  stationary  regenerative  Markov  chain  with  the 

cycle  length  T^  aperiodic  and  satisfying  E  T^a  <  ©,  for  some  a>  1.  If 

u  =u  (r)  satisfies  (2.2.1)  for  some  r>0  and 
n  n 

E(P  (ri0)S)  n1  +  s7a.  ->  o  as  n  -*  © 

for  some  s>l  with  1/a  +  1/s  <  1  then  {?  )  has  extremal  index  9=1. 

We  also  refer  to  [79,  Theorem  4.1]  and  [68,  Theorem  2.1]  for 
additional  results  on  the  extremal  index  and  Compound  Poisson  Convergence,  for 
general  distributionally  mixing  sequences,  in  a  form  which  is  particularly 
convenient  for  applications  to  Markov  chains.  Finally  the  restriction  that 
the  Markov  chain  (or  regenerative  sequence)  is  started  with  the  stationary 
initial  distribution  is  not  essential.  All  the  results  hold  for  arbitrary 
initial  distributions,  provided  only  that 

P(f  >  max  -*  0  as  k  -  co. 


2.7  Moving  averages 


Here,  a  stationary  sequence  { £^}  is  a  moving  average  if  it  can  be 


? 


written  in  the  form 


(2.7.1)  £  =  l  t  =  0,±l,..., 

i=-co 

where  (£t)  is  an  i.i.d.  sequence  (the  "noise  sequence")  and  {c^}  is  a  sequence 
of  constants  (the  "weights")  and  where  the  sums  are  assumed  to  converge  with 
probability  one.  If  a  stationary  normal  sequence  has  a  spectral  density  - 
this  holds  e.g.  if  Erfc  <  ®,  it  can  be  represented  in  a  non-unique  way, 
as  a  moving  average  with  normally  distributed  £  's.  Further,  (2.7.1)  includes 
the  ARMA-processes  (which  satisfy  a  finite  linear  difference  equation  in  the 
£ '  s  and  hence  are  multi-dimensional  Markov  chains) ,  which  are  extensively 
used  in  time  series  analysis.  Thus,  in  particular,  some  of  the  themes  from 
Sections  2.5  and  2.6  will  be  taken  up  again  here,  but  from  a  slightly 
different  point  of  view. 

The  extremal  behavior  of  (£^)  depends  on  both  the  weights  and  the  two 
tails  of  the  marginal  d.f.  of  the  noise  variables  in  an  intricate  and 
interesting  way.  To  reduce  the  amount  of  detail,  we  shall  only  describe  the 
asymptotic  distribution  of  the  maxima,  for  the  case  of  non-negative  c^'s. 

The  general  case  involves  some  extra  complexity,  since  then  an  extreme 
negative  noise  variable  which  is  multiplied  by  a  negative  c^  may  contribute  to 
a  large  £t~value.  In  addition  to  this,  the  references  cited  belcw  prove 
point  process  convergence  and  give  rather  detailed  information  on  the  sample 
path  behavior  near  extremes,  including  the  clustering  which  occurs  when  the 
extremal  index  is  less  than  one.  Here  we  will  only  exhibit  the  limiting  form 
of  the  sairple  paths  near  extreme  values  without  going  into  technicalities, 
referring  to  [75],  [76],  [29]  for  further  details. 


S'W<AjSv,sv.-.y.‘  AW.yv  v„-\  -’.'V-V-VV-V- 
*■  ',“*1  "  •  *  v  v.vvvv-i v,-  v v/ w vr/ 


,%'V  ‘.VvV.vV-*  •/  *.* 


*-  A  *.  %  \ 


In  cases  when  (1.2.1)  holds,  i.e.  when 

d 

(2.7.2)  p{an^n~bn^  5  G(x)'  as  n  -*  ®, 

the  asymptotic  behavior  of  the  maximum  is  specified  by  the  constants  an  >  0,bn 
and  the  d.f.  G.  However,  this  involves  an  arbitrary  choice,  since  if 

I 

a  ,  b  are  replaced  by  a',b',  where  a  /a'  -»  a  >  0  and  a  (b  -  b  )  ->  b,  then 
n  n  F  ■‘nn  rrn  n  n 

(2.7.2)  still  holds,  but  with  G(x)  replaced  by  G(ax  +  b) .  In  the  sequel  we 
will  keep  the  G's  fixed,  as  the  standard  d.f. 's  displayed  in  Theorem  1.2.1  and 
hence  describe  extremal  behavior  by  an,  b^  and  the  type  of  G. 

The  effect  of  dependence  on  extremal  behavior  can  be  further 

understood  by  comparing  with  extremes  of  the  noise  sequence  and  of  the 

associated  i.i.d.  sequence  (ft)  with  the  same  marginal  d.f.  as  the  moving 

average  (ft)-  Specifically,  for  =  maxj^, . . . ,  f  )  and  . . . ,  fn)  there 

are  norming  constants  a  ,  a„>  0  and  b  .  b  such  that  for  the  cases  we 

n  n  n  n 

consider  here, 

(2.7.3)  P<an(Sh  "  V  S  X}  "  GM 
and 

(2.7.4)  p<an(”n  "  bn}  5  x}  4  G(x)' 

with  the  same  G  as  in  (2.7.2),  and  we  shall  indicate  the  relations  between  the 
different  norming  constants. 

The  articles  by  Rootzen  ([75])  and  Davis  and  Resnick  ([29])  are 
concerned  with  noise  variables  which  are  in  the  domain  of  attraction  of  the 
type  II  extreme  value  distribution,  or  equivalently  when  the  noise  variables 
have  a  regularly  varying  tail, 

(2.7.5)  P(fQ  >  X)  =  x-«L(x), 

with  a  >  0,  and  L  slowly  varying  at  infinity.  Hence,  using  the  prescription 
for  norming  constants  given  after  Theorem  1.2.5,  if  7n  satisfies  P (fQ  <  7n)  £ 
1-1/n  <  P(rQ  £  7 n) ,  so  that  7^  is  roughly  of  the  order  n1/a,  then  (2.7.3) 


holds,  with 


G(x)  =  exp(-x-<*)  , 

i 


x  >  0 . 


Let  c+  =  max  {c^;  i  =  0, +!,...}•  Then  also  (2.7.2)  is  satisfied,  with 


lG(x)  =  exp(-x_a)  . 

This  is  elegantly  proved  in  [29],  by  first  noting  that  complete  Poisson 
convergence  of  extremes  of  the  f  -sequence  is  immediate  (cf.  Section  1.3)  and 
then  obtaining  the  corresponding  result  for  the  £  's  by  a  "continuous  mapping" 
and  approximation  argument.  [29]  uses  same  summability  assumptions  on  the 
c^'s,  and  for  convenience  that  c^=0  for  i=-l,-2, . . .  .  However,  it  seems  clear 
that  the  results  hold  without  any  restrictions  beyond  the  assumption  that  the 
sums  in  (2.7.1)  converge,  cf.  [75]. 

An  intuitive  explanation  of  (2.7.6)  is  that  when  the  tails  of  the 
noise  variables  decrease  slowly,  as  in  (2.7.5),  then  the  extreme  noise  values 
cue  very  much  larger  than  the  typical  ones,  and  that  hence  the  maximal 
value  asymptotically  is  achieved  when  the  largest  ^-value  is  multiplied  by 
the  largest  weight,  c  .  This  of  course  agrees  with  (2.7.6),  since  the  norming 
constants  there  are  the  same  as  those  which  apply  to  max{c+f ^, . . . ,c+fn) . 

These  heuristic  arguments  also  easily  lead  to  the  following  form  of  the 
normalized  sample  path  ft  +  t/£t  near  an  extreme  value  at,  say,  the  time 
point  t ;  asymptotically  this  ratio  has  the  same  distribution  as  the  function 
yt  given  by 

(2.7.7)  yt  =  uc_t, 

where  U  is  a  certain  random  variable  with  values  in  the  set  { . . ,l/c_^,l/c0,l/c. 
Thus,  except  for  a  random  height,  sample  paths  near  extremes  are 


asymptotically  deterministic. 

The  special  case  of  (2.7.5)  when  the  noise  variables  are  stable  (or 
"sum-stable",  as  opposed  to  max-stable)  was  studied  first,  in  [75].  It  has 
the  appealing  feature  that  then  also  the  moving  average,  and  indeed  all  linear 
functions  of  the  noise  variables  are  jointly  stable.  For  such  variables,  it 
is  easily  seen  that  (2.7.4)  holds,  with 

an  -  Jn,  bn  -  0 

G(x)  =  exp(-x-<M, 
and  hence  also  that  the  extremal  index  is  c^/lc& ,  for  the  case  of 
non-negative  c's  discussed  here.  Although  not  considered  in  [29],  this  can  be 
shown  to  hold  also  for  the  general  case  (2.7.5) ,  provided  the  sums  involved 
are  convergent. 

The  other  class  of  moving  averages  which  has  been  studied,  in  [76] , 
is  specified  by 

(2.7.8)  P(fQ  >  x)  ~  Kxae  as  x  ■>  t», 

where  K,p  >  0  and  a  are  constants.  Again  it  follows,  using  Theorem  1.2.5, 
that  (2.7.3)  holds,  with 

an  =  P(log  n)1-1/p 

t>n  =  (log  n)1//p  +  p_1((a/p)log  log  n  +  log  K)  (log  n)1//p_1 
1  G  ( x)  =  exp  {-e-x}. 

Thus  the  center  of  the  distribution  of  tends  to  infinity  roughly  as 
(log  n) 1//p,  and  the  "scale  parameter"  a”*  is  of  the  order  (log  n)1//p-1,  which 
shows  that  for  p>l  the  distribution  of  becomes  more  and  more  concentrated 
asn->  co,  and  that  it  becomes  increasingly  spread  out  for  0<p<l,  while  the 
order  of  the  scale  does  not  change  for  p=l.  As  we  shall  see,  the  same  holds 
for  and  Mn> 

The  case  when  (2.7.8)  holds  with  p=l  leads  to  intermediate  behavior, 


and  we  will  only  discuss  the  remaining  cases.  For  0<p<l  again  a  large  ?- 
value  is  caused  by  just  one  large  noise  variable,  in  a  similar  way  to  the 
behavior  when  (2.7.5)  holds.  However,  the  non-zero  bn-terms  cause  same  extra 
complications.  Thus,  (2.7.2)  holds  with 


a  =  c  a  ,  b  =  c  b 
n  +  n  n  +  n 

G(x)  =  axp(-e-x) , 


in  analogy  with  (2.7.6),  but,  writing  k  for  the  number  of  i*s  with  c^=c+,  the 
appropriate  version  of  (2.7.4)  involves 


j  an  ‘  <  V  bn  "  =+(V(l0,3  k)/an> 
l  G(x)  =  exp  {-e  x}. 

Also  the  asymptotic  form  of  the  sample  path  E  t - t/ E  t  near  an  extreme  value  at 
r  is  similar.  For  k  =  1  it  is  given  by  the  deterministic  function 

yt  =  WV 

while  in  the  general  case  it  is  a  random  translate  of  this. 

The  case  when  (2.7.8)  holds  with  p>l  is  more  intricate,  since  then  an 
extreme  E -value  is  caused  by  many  moderately  large  noise  variables  in 
conjunction,  and  since  extremal  behavior  is  determined  by  the  constant  |  | c  |  | 

=  ( 1 1  c  .  |  ^) 1//q  and  the  function 


(2.7.9) 


yt  =  “ci-t  C^P/I  !cl  lq' 
i  i  q 


with  q=(l-l/p)  .  In  fact,  the  normalized  sample  path  Et+r/Er  near  an 

extreme  at  r  asymptotically  has  the  deterministic  form  (2.7.9),  and  (2.7.2) 
and  (2.7.4)  hold,  with 


(2.7.10) 


=  a„  = 
n  n 


cj  I"1  a  ,  b  =  b 
1  q  n  n  n 


G(x)  =  exp  {-e  ) 


Here  bn  =  bn  is  not  determined  by  (2.7.8)  alone,  except  for  finite  moving 


,•  V‘/>%  ■ 
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averages,  it  is  also  influenced  by  the  center  of  the  distribution  of  the  £'s. 

However,  it  is  roughly  of  the  order  | |c|  |  b  ,  but  still 

c[  n 

anl  (bn“l  i  c  1  I  g  bn)  I  may  in  general  tend  to  infinity.  It  of  course  follows  at 
once  from  (2.7.10)  that  the  extremal  index  is  one  for  p>l. 

For  p=q=2,  which  includes  the  normal  case,  (2.7.9)  is  the  correlation 
function  and  |  |  c  j  |  is  proportioned  to  the  standard  deviation,  in  agreement 
with  Section  2.5.  However,  it  is  interesting  to  note  that  for  p?2 
covariances  seem  to  have  little  bearing  on  extremes. 

The  results  for  the  case  (2.7.8)  use  the  assumption  that 
|  c  .  |  *0  (  ]  i  |  ®) ,  for  some  9  >  max  (1,2/q),  and  for  p>l  in  addition  a  number  of 
smoothness  restrictions  on  the  distribution  of  the  noise  variables.  These  are 
mainly  used  in  the  derivations  of  the  behavior  of  the  tail  of  £0=Ec^f_^, 
which  for  p>l  is  the  main  difficulty,  cf.  [78].  It  is  fairly  easy  to  see  that 
D(un)  holds  for  all  the  moving  averages  considered  here,  and  the  results  above 
for  p>l  are  obtained  along  the  lines  set  out  in  Section  2.2,  by  verifying 
(2.2.2) .  For  0<pil,  i.e.  in  the  cases  when  0  may  be  less  than  one,  the 
proofs  use  ad  hoc  methods,  closely  related  to  the  heuristic  arguments  given 
above. 

Finally  it  should  be  mentioned  that  Finster  ([33])  obtains  some 
related  results  using  autoregressive  representations  of  the  processes,  and 
that  Chemick  ([18])  provides  an  example  with  qualitatively  different 
behavior. 

2.8  Rates  of  convergence 

Rates  of  convergence  for  the  distribution  of  the  maximum  have  mainly 
been  studied  for  i.i.d.  variables.  In  the  present  section  we  briefly  review 
this  work,  discussing  in  turn  pointwise  rates,  uniform  convergence  of  d.f.'s. 


so  called  "penultimate"  approximations,  uniform  convergence  over  the  class  of 
all  sets,  and  "large  deviation"  type  results.  Although  generalizations  seem 
straightforward,  the  only  dependent  sequences  which  have  been  considered  are 
the  normal  ones.  The  quite  precise  results  available  for  this  case  are 
discussed  at  the  end  of  the  section.  A  useful  general  observation ,  which 
applies  to  i.i.d.  and  dependent  cases  with  extremal  index  0=1,  is  that  once 
rates  of  convergence  of  the  maximum  have  been  found,  then  it  is  typically 
quite  easy  to  find  similar  rates  for  order  statistics. 

For  i.i.d.  random  variables  and  a  given  u  ,  the  error  P(M^  £  un)-e~T 
in  the  approximation  (1.2.3)  is  easy  to  compute  directly,  since  then 
P(Mn<un)=Fn(un) ,  where  F  is  the  common  d.f.  of  the  variables.  Further  if  F 
is  continuous  one  can  always  make  the  difference  zero  for  any  n,r>0  (by 
taking  un=F  (e  '  ) ) .  However,  often  u^  is  determined  from  other 
considerations,  e.g.  in  (1.2.1)  it  is  chosen  as  un==un(x)=V3n+k’n  31x3  corre- 
pondingly  t=t  (x)=-log  G(x) .  Then  the  behavior  of  the  approximation  error 

An(x)=P(Mh-un(x))  _e"T(x>' 
perhaps  over  a  range  of  x-values,  and  in  particular  of 

dn(an'bn)  =suP!An(x^  =  sup!  p(anWn"bn,-x)  -  G(x)  | 

X  X 

is  less  immediate.  If  (1.2.1)  is  used  as  an  approximation  or,  more 

importantly  if  it  motivates  statistical  procedures,  when  an,  bn  have  to  be 

estimated,  interest  centers  on  which  rate  of  decrease  is  attainable  when  the 

"best"  a  ,  b  are  used,  i.e.  on 
n  n 

d  =inf  d  (a,b)  =  inf  sup  i,  P{a(M_ -b)  £x)  -  G(x)  !  . 
n  a>0,b  n  a>0,b  x  J‘ 

It  is  easy  to  give  examples  of  distributions  F  for  which  dn  tends  to  zero 
arbitrarily  slcwly,  and  to  any  exponential  rate  there  is  an  F  which  achieves 
this  rate.  Hcwever  faster  than  exponential  decrease  of  dn  implies  that  F  is 


max-stable,  and  then  dn=0  for  all  n,  [8],  [77].  Also  different  standard 
distributions  give  quite  different  rates,  e.g.  for  the  normal  distribution  dn 
is  of  the  order  1/log  n  while  for  the  uniform  and  exponential  distributions 
the  order  is  1/n. 

Let  rn  =  rn(x)  =  n(l-F(un(x) ) ) .  In  the  sequel  we  will  usually,  for 

brevity,  delete  the  explicit  dependence  on  x.  An  obvious  approach  to 

analysing  a  (=a  (x) )  in  the  i.i.d.  case  is  to  introduce 
n  n 

*  n  Tn  ”  Tn 

A  n=  ^  -e‘T' 

so  that 

(2.8.1)  |aj  =  |  F(u^)n  -  e~  T  |  =  I  (i“Tn/n)n  -  e- T  |  <  |  a  'J  +  |  a"J  . 

Here  0<rn<n,  and  for  such  values  the  satisfying  uniform  bound 


(2.8.2) 


a'J  £  n  1(a  +  n  1)e  2 


is  derived  by  Hall  &  Wellner  ([46]).  Further,  for  fixed  t,  by  Taylor's  formula 


(2.8.3) 


'"■J  -  e-r  \rn-r\, 


as  r  -»r.  However,  (2.8.3)  is  only  uniform  for  r  -r  =  r  (x)  -  t  (x)  in 
n  n  n 

intervals  which  are  bounded  from  below,  and  to  bound  d^  a  further  argument  has 

to  be  added.  Often  this  runs  as  follows;  (2.8.2)  and  (2.8.3)  give  sharp 

estimates  of  sup  |  a  (x)  |  ,  for  any  a  >  xQ,  where  xQ  is  the  left-hand  endpoint 
x>a  n  u  u 

of  the  d.f.  G,  and  then  also  for  sup  j  a  (x)  |  if  x  is  taken  to  converge 

^n  n 

to  xQ  suitably  slowly.  Combining  this  with 
(2.8.4)  sup  |a  (x)|  <  max  (Fri(x  /a  +b  ) ,  G(xn)  } 

X£Xn 

leads  to  a  bound  for  d  (a  ,  b  ) ,  and  then  by  varying  a  ,  b  ,  to  bounds  for  d  . 

n  n  n  n  n  n 

This  approach  is  used,  with  same  variations,  by  Hall  &  Wellner  ([46]),  Davis 
([26]),  Cohen  ([19]  [20]),  and  Leadbetter  et  al.  [55]).  Here  the  bounds 
corresponding  to  (2.8.2)  and  (2.8.3)  are  asymptotically  sharp,  but  there  is  a 
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possibility  that  and  can  at  least  partially  cancel.  However,  this 
happens  only  if  r  *  t  -  r2/( 2n)  +  o(l/n) ,  and  hence  in  fairly  special 
cases,  as  is  readily  seen  (cf.  Davis  ([26])). 

A  number  of  papers,  seme  of  the  later  references  being  Cohen 
([19] [20]),  Smith  ([82]),  and  Resnick  ([73]),  have  introduced  conditions  which 
permit  more  explicit  bounds  than  (2.8.1)  -  (2.8.4)  to  be  calculated.  Their 
approach  is  to  take  some  set  of  conditions  for  attraction  to  an  extreme  value 
distribution,  typically  involving  convergence  of  seme  quantity  related  to  the 
tail  of  F,  and  show  that  if  this  holds  at  a  specific  rate  then  d  (an,bn) 
converges  at  a  corresponding  rate.  In  this  a  set  of  simple  sufficient 
conditions  due  to  von  Mises  ([63])  (cf.  [55],  p.  16)  have  been  particularly 
useful.  There  are  many  possible  versions  of  such  conditions,  and  hence  many 
partially  overlapping  results  have  been  obtained.  As  a  typical  example  we 
cite  the  following  result  of  Resnick  ([73]). 

Suppose  F  is  differentiable  and  that  there  exists  a  continuous 
function  g  which  tends  monotonically  to  zero  and  which  satisfies 

<2-8-5'  fmtSw  -*1  !5M’  x>0' 

for  same  a>0.  Then,  if  afi  is  chosen  to  satisfy  -log  F(^ 1)  =  n  \ 

sup  iF^x/aJ  -  exp{-x-a)  |  <  .2701  g(a:1)/(a-g(a'1) ) , 
x>l  n 

for  n  such  that  gfa”1)  <  a.  Here  (2.8.5)  is  a  slight  variation  of  von  Mises’ 
condition  for  attraction  to  the  type  II  extreme  value  distribution,  and  the 
proof  is  somewhat  different  from  the  method  outlined  above,  the  main 
ingredient  being  an  estimate  of  -log  (-log  F(x)).  Resnick  also  obtains  a 
somewhat  more  complicated  bound  for  the  supremum  dn(an,0)  over  all  x. 

For  i.i.d.  variables  bounds  on  the  rate  of  convergence  of  the  maximum 


automatically  lead  to  bounds  for  the 
values.  This  follows  by  using  (1.2.7) 
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([43]),  and  gives  the  rate  of  convergence  l/(log  nj  for  a=2,  while  other 
values  of  a  lead  to  the  same  order  1/log  n  as  for  itself.  Numerical 
computations  show  that  these  approximations  also  do  better  for  small  and 
moderate  values  of  n,  as  could  be  expected. 

A  further  statistically  relevant  question  is  to  find  rates  of  uniform 
convergence,  i.e.  to  bound 

d'  =  inf  sup  |  P(a  (M  -b  )  eB)  -  G(B)  | 
n  a>0,b  B  eB  n  n  n 

where  B  denotes  the  Borel  sets  in  R ,  and  G(B)  is  the  probability  that  a 
random  variable  with  d.f.  G  belongs  to  B.  The  obvious  approach  is  to  bound 
the  difference  between  the  density  (which  is  assumed  to  exist)  of 
and  G'.  Let  G' (x)  =  G(x)7(x),  so  that  7(x)  =  e  x,  ax*0'1,  and  af-x)0-'1 
for  the  type  I,  II  and  III  extreme  value  distributions,  respectively.  Since 
(for  i.i.d.  variables), 

h  P<an<Vbn'  "  h  f’'<*Vbn>  “  F<^an  +  V""*  ra'nF'  <*/an+  V  - 


where  the  first  factor  tends  to  G  at  a  rate  given  by  the  references  cited 
above,  the  main  problem  is  to  bound  the  difference  nan  F' (x/an  +  k>n)  “  >  (x)  • 
The  recent  thesis  by  Falk  ([32])  contains  a  survey  of  results  in  this 
direction,  some  further  recent  work  being  that  of  de  Haan  &  Resnick  ([42])  and 
Weissman  ([86]). 

Another  problem  which  has  attracted  some  attention,  partly  because  of 
reliability  applications,  is  the  uniformity  of  the  convergence  of 
P(an(Mn-bn)>x)/(l-G(x) )  for  large  x;  see  Anderson  ([2])  and  de  Haan  &  Hordijk 


([40]). 


For  a  stationary  dependent  sequence  with  extremal  index  6=1,  a 


further  source  of  error  is  the  approximation  by  the  associated  independent 
sequence,  i.e.  the  difference 


'X 

yv. .. 


9 


4' '  <*>  -  p<an<Vbn>  s  *>  - 

where  F  is  the  marginal  d.f.  of  the  sequence.  Cohen  ([19])  shows,  under  weak 

I  I  I 

covariance  conditions,  that  for  a  stationary  normal  sequence  is 

o( 1/log  n) ,  and  hence  that  the  rate  of  convergence  in  (1.2.1)  is  determined  by 
the  difference  Fn(v'an+bn)  -  G(x),  and  hence  is  the  same  as  in  the  i.i.d. 
case.  Let  p  be  the  maximal  correlation  in  the  stationary  normal  sequence. 
Rootzen  ([77])  gives  a  first  order  approximation  and  bounds  for 

(1-P)/(1+P) 

1  which  are  roughly  of  the  order  1/n  for  p  >  0. 

(k) 

By  using  an  embedding  technique,  these  rates  are  extended  also  to  IT 
and  to  point  processes  of  exceedances.  This  embedding  can  be  used  more 
generally,  and  hence  also  in  dependent  cases  rates  for  the  maximum  often 
easily  lead  to  similar  rates  for  k-th  largest  values. 

2.9  Multivariate  extremes 

We  shall  discuss  here  only  one  multivariate  problem,  the  Extremal 
Types  Theorem  for  i.i.d.  random  vectors,  and  its  extension  to  dependent 
sequences.  As  shown  by  de  Haan  &  Resnick  ([41])  and  Pickands  ([72])  the 
problem  of  characterizing  the  possible  limit  laws  of  the  vector  of 
coordinatewise  maxima  splits  into  two  independent  problems,  to  find  the 
marginal  d.f. 's  which  may  occur  -  by  the  one-dimensional  result  this  is  just 
the  class  of  extreme  value  d.f. 's  -  and  to  characterize  the  limiting 
dependence  between  components.  Following  Deheuvels  ([30])  and  Hsing  ([49])  we 
will  use  the  concept  of  dependence  functions  to  discuss  this. 

Let  I  =  (f  . . .  ,{d)  be  a  d-dimensional  random  vector  with  d.f.  G  and 
marginal  d.f.'s  G ^ ,  lsjsd.  The  dependence  function  D  of  i  (or  of  G)  is 
defined  by 


D(x. , . . . ,  X , )  —  P{G.  ( { . )  i  x.  , . . . ,  G  .  ( £  ,)  -  X  . } . 


D  is  the  d.f.  of  a  distribution  on  [0,1],  and  it  has  uniform  marginal 


distributions  if  the  Gj's  are  continuous.  The  marginal  distributions  together 
with  the  dependence  function  determines  G,  since 

(2.9.1)  G(x^, . .  •  , X^j)  =  D (G^ (X^)  ,  • • • ,Gd (xd) )  /  •  •  •  > ^  71  • 

This  is  a  consequence  of  the  relation 

d 

(Gjdj)  <  G.CXj);  l<j<d}\  U^{Gj  j)  SGj(Kj),  >Xj} 

c  Uj  <  Xj;  l<j<d>  c  {Gj(fj)  <  GjCXj),  l<j<d), 
since  it  is  readily  seen  that  P(G_j(£j)  5  G^(Xj),  £  ^  >  x^)  =  0  for  each  j. 

A  further  useful  property  is  that  convergence  of  d-dimensional 
distributions  is  equivalent  to  convergence  of  the  dependence  function  and  the 
marginal  distributions,  provided  the  limit  has  continuous  marginal  d.f. 's. 

This  can  be  proved  rather  easily,  using  (2.9.1) .  Similarly  to  the  one¬ 
dimensional  case,  a  d-dimensional  d.f.  G  is  said  to  be  max-stable  if  there 

exist  constants  a  .  >  0,  b  .  ,  i=l,...,d,  such  that 
n,i  n, i 

(2.9.2)  5  (a^j,  1x1  +  bn,i'-’*'an,dXd  +  bn,d)=^(xl' * ’ * 'xd}  x1,...,xd  e 
for  each  n=l,2,...  .  Further,  a  dependence  function  D  is  max-stable  if 

(2.9.3)  Dfyx^",  ...,Xd/n)  =  D(x1, . . .  ,Xd)  ,  Xlf...,Xd  e  R, 
for  n  =  1 , 2 , . . .  . 

Theorem  2.9.1  A  d-dimensional  (d  >  2)  d.f.  with  nondegenerate  marginal 
distributions  is  max-stable  if  and  only  if  its  marginal  d.f.'s  and  its 
dependence  function  are  max-stable. 

Proof.  If  G1,...,Gd  cure  max-stable,  or  if  G  is  max-stable,  then  there  are 

constants  a  .  >  0,  b  .  with  G?(a  .x  +  b  .)  =  G. (x) ,  for  i=l,...,d.  Hence, 
n,i  n,i  l  n ,  1  n f  x  x 

in  either  case, 

(2.9.4)  5n(anjlVbnil . an,dVbn,<i>  ‘  . 0d<an,dVbn,d 

-  . Gd(xd)1/n). 


Thus  (2.9.2)  follows  at  once  if  D  is  max-stable,  by  (2.9.1).  The  converse, 
i.e.  that  fiP(yJ/n, . . . ,y^/n)  =  D(y1, . . . ,yd) ,  for  yi  e  (0,1),  i=l, . . . ,d  if  G  is 
max-stable  also  follows  from  (2.9.4),  by  talcing  x^=0^^(y^)  there  (note  that 
each  is  nondegenerate  max-stable  and  hence  continuous  and  strictly 
increasing  on  its  support) . 

CO 

bet  (in)  =  { ( f  n  d)  )n=1  be  a  sequence  of  i.i.d.  random 
vectors,  write  M  .  =  max{  f  1  . )  and  suppose  there  are  constants 

an,i>0'  bn,i  such 

(2.9.5)  2  *'  ISiSd)  G(x1 . xd), 

where  we  may  assume  without  loss  of  generality  that  the  marginal  distributions 
of  G  are  nondegenerate.  It  then  follows  exactly  as  in  the  one-dimensional 
case  that  the  possible  limits  G  in  (2.9.5)  are  precisely  the  max-stable 
d.f.  's.  Thus,  by  Theorem  2.9.1  each  marginal  d.f.  is  max-stable  and  hence  one 
of  the  three  extreme  value  types,  and  the  dependence  function  is  max-stable. 
Further  the  distribution  of  a  . (M  ,-b  . )  tends  to  G. ,  for  i=l, . . . ,d  and  the 

li/i  n^i  rifi  1 

dependence  function  of  (M^  i=l, . . . ,d)  converges  to  the  dependence  function 
of  G.  To  complete  the  characterization  of  the  limits,  it  only  remains  to 
describe  the  max-stable  dependence  functions.  Again,  this  is  a  purely 
analytical  problem,  to  solve  the  functional  equation  (2.9.3) ,  and  we  thus  only 
cite  the  result,  which  is  obtained  in  somewhat  varying  forms  in  [45],  [72], 

[30] ,  and  [49] . 

Theorem  2.9.2  A  function  p  on  [0,1]^  is  a  max-stable  dependence  function  if 

and  only  if  it  has  the  representation 

P(y,  ,...,y.)  =  exp  (I  min  (x.  log  y.)du), 

S  l<i<d 


■■r.v.  *.  ■■■  V  .  ■■  V^'\oy.V.V.V->V.V«V-hh' 


where  S  is  the  simplex  { (x^  . . .  ,xd) :  x^  >  0,  i=l,...,d,  E£xyl } ,  for  same 

finite  measure  u  on  S  which  satisfies  /x.du  =  1,  for  i=l,...fd. 

S 

Hsing,  ([49])  also  makes  the  observation  that  while  the 
characterization  of  the  limiting  marginal  d.f.'s  is  crucially  tied  to  linear 
normalizations,  this  is  not  so  for  the  dependence  function.  Specifically,  if 
(Uh  i^J  31-6  which  are  continuous  and  strictly  increasing  in  x,  and  if 

d 

P(Mn,i  5  Vi^i*'  i=1'***'d)  *  fi(x1,.../xd) 


where  G  has  continuous  marginal  distributions,  then  the  dependence  function  of 

G  is  max-stable.  The  basic  reason  for  this  is  the  obvious  fact  that  if 

T], '  *  ■  * ' Td  3X6  continuous  and  strictly  increasing,  then  ( £  . . . ,  £ d)  and 

(T1  ( ^  l) '  ‘  ’  ,Td^d^  ^  have  the  sanle  dependence  function. 

Hsing  also  extends  these  results  to  stationary  dependent  sequences 

(I  },  along  rather  similar  lines  as  for  the  one  dimensional  case,  as  treated 

in  Sections  2.1  and  2.2.  Specifically,  for  given  constants  {u^  y  j=l,...,d, 

nil}  the  condition  D(u  u  .)  is  defined  to  hold  if  there  is  a  sequence 

n,l  n,d 

1  n  =  o(n)  such  that  an  ^  ->  0  as  n-»  a>  for 

an,l  =  max  { I  P(?i,jsun,j:  j=1/  •  •  •  /d,  i  €  A  U  B) 

-p ( ? i , j  un^:  j-1, . . . ,d,  i«A)  P(fi  j=l,...,d,  i  e  B)  |  } 

where  the  maximum  is  taken  over  all  sets  A,B  such  that  Ac  (1, . . .  ,k) , 

Be  {k+1 , . . .  ,n) ,  for  same  k.  If  D(un  d)  holds  the  only  possible 

limits  in  (2.9.5)  again  are  the  max-stable  d.f.'s.  Further,  if  in  addition 
d  d  [n/k] 

(2.9.6)  lim  sup  n  I  E  E  P{£.  .  >  u.  ,  $  .  .  >  u  .  )  ■>  0  as  fe® 

iyl  i2=l  j=2  1,1  1  ,;L1  3,1  2  ,x2 

then  P(M^  ^  <  un  y  i=l,...,d)  ->  p  > 0  if  and  only  if  Pn(£1  ^  ^ 

i=l,...,d)  -»  p ,  i.e.  the  asymptotic  distribution  of  maxima  is  the  same  as  if 


the  vectors  were  independent.  ((2.9.6)  of  course  reduces  to  (2.2.2)  for  d=l) . 
A  further  question  considered  by  Hsing  is  independence  of  the 


marginals  in  the  limiting  distribution.  In  particular,  he  shews  that  if 


(2.9.7)  lim  sup  Z 


[ryk] 


l,[2. 1  A  p,fH  ’M'  ‘iu/Vi;*0- 


h*h 


as  k->a>,  and  if  0(11^  1,...,uh  d)  is  satisfied  then  (2.9.5)  holds  if  and  only 


if  ^-kn  ^)  s  x}  G^(x) ,  as  moo,  for  i=l,...,d,  and  G  then  is  of  the 

form  G(x1,...,xd)  =  G1(x]_)G2(x2)  ...  Gd(xd). 

Now  let  (f  }  be  normally  distributed  with  E£  .=0,  V(£  .  )=1  and  let 

Ii  X  /  J>  1  ^  X 

r..(n)  be  the  covariance  between  f  .  .  and  £.  ..  If  r.,(0)  <  1,  for 

1J  l'rnil  lj 

lii^jsd  and  r^  (n)  log  n  ->  0,  as  n-»  ®,  for  i,j=l,...,d,  and  u^  ^=x^/an+bn, 
with  an,bn  as  in  (1.2.5)  then  D(un  d) ,  (2.9.6),  and  (2.9.7)  are 

satisfied,  so  that  the  asymptotic  distributions  of  maxima  are  the  same  as  for 
a  sequence  of  independent  normal  vectors  with  independent  components  (see 
[49],  [1]). 

2.10  Convergence  of  sums  to  non-normal  stable  distributions. 

To  consider  the  simplest  case  first,  suppose  {£^}  is  a  sequence  of  i.i.d. 

symmetric  r.v. 's,  with  P(£^  >  x)  ~  x_a,  x  ->  ®,  for  same  a  e  (0,2),  so  that  the  { 

belong  to  the  domain  of  attraction  of  the  type  II  extreme  value  distribution,  with 

norming  constants  a_=n" l/a,  b  =0.  Let  and  m^  denote  the  k*^1  largest 
J  n  n  n  n 

and  k  respectively,  of  .  It  is  straightforward  to  shew  that  then 

(2.10.1)  lim  sup  P{  |  a  (  Z  f  .  -  I  (M^+m^)  j  >  S )  -»  0  as  r  ->  ®, 
n-»®  n  i=l  1  k=l  n  n 

for  any  a>0.  By  Theorem  1.3.2  the  joint  distribution  of  (an  Mvn k=l,...,r) 
converges,  and  the  limit  can  be  found  explicitly,  sis  in  Theorem  1.3.3.  For 
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m 


m 


example,  the  joint  asymptotic  distribution  of  and  a^^  is  as  given 


there,  with  G(x)  =  exp(-x-a).  Let  {M'  ' :  k=l,...,r}  have  this  distribution, 


so  that  {a  :  k=l,...,r}  $  (M^  :  k=l, . . . ,r} .  Similarly,  there  are 
(m^:  k=l,...,r},  with  {anm^ :  k?=l,...,r}  £  {m^  ;  k=l,...,r},  and  it  is 
easy  to  see  that  there  is  also  joint  convergence,  {a  ,  a  :  k=l, . . .  ,r} 
*  (M^ ,  m^  ;  k=l,...,r).  Hence 


an  l  (^k)+n^k))  3  I  (M(k)+m(k))  as  n  4  ». 


It  can  further  be  shown  that  I  (M^  +  ra^)  tends  in  distribution  to  a  stable 

k=l 

(or  "sum  stable",  cf.  Section  2.7),  limit  with  index  a  as  r*®,  and  then  it 

n 


follows  at  once  from  (2.10.1)  that  a  I  £  .  has  the  same  stable  limit. 

n  i-l  1 


This  illustrates  that  the  central  limit  problem  of  convergence  of  sums 


to  non-normal  stable  distributions  hinges  on  the  convergence  of  extreme  order 


statistics,  and  the  most  natural  approach  to  it  is  perhaps  via  extreme  value 


theory.  In  Theorem  2.10.1  below  this  is  made  precise.  The  theorem,  which 


builds  on  ideas  of  Durrett  and  Resnick  ([31])  and  Resnick  ([73]),  contains  a 


functional  central  limit  theorem,  and  the  corresponding  extreme  value  result 


is  the  "complete"  convergence  of  upper  and  of  lower  extremes,  which  is 


discussed  in  Sections  1.3  and  2.4.  The  corresponding  one-dimensional  approach 
via  the  joint  distribution  of  extremes,  as  sketched  above,  is  vised  in  [56]  and 


[28]  and  will  be  briefly  discussed  at  the  end  of  this  section. 


The  results  depend  essentially  on  the  Ito-Levy  representation  of  the 


stable  process,  and  we  shall  now  list  the  relevant  properties,  referring  to 


Ito  ([51],  Section  1.12)  for  proofs  and  further  information.  Let  { -n  (t) 


0<t<l}  be  a  non-normal  stable  stationary  independent  increments  process 


(briefly,  { ^  (t)  )  will  be  referred  to  as  a  stable  process) .  { n  (t) )  can  -  and 


will  throughout  -  be  assumed  to  have  sample  paths  in  D[0,1]  the  space  of 


A  •*,  •*-  ^ A  •*.  .*  < *  .  -  •  •'V-  .  . 


RwS<W 


•  ’  M  ?  M  m.  h '  *V 
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functions  on  [0,1]  which  sire  right  continuous  and  have  left  limits  at  each 
point.  Let  s=[0,l]xR,  with  »=[-«,  ®]\{0},  and  define  the  I  to  process  N  of 
juirpe  of  { n  (t)  >  by 

(2.10.2)  N(A)  =  #{t:  (t,  i>  (t)  -  n  (t-) )  £  A} 

for  Borel  sets  A  c  S,  where  r,  (t)  -  -n  (t-)  is  the  jump  of  n  ( * )  at  time  t. 

Then  N(A)  is  (measurable  and)  finite  a.s.  for  each  rectangle  A  such  that 
Ac  [0,1]  x  [-«,-€]  U[  £,©]  for  seme  £>0.  Hence  N  is  a  point  process,  and  in 
fact  it  is  a  Poisson  process  with  intensity  measure  v  which  is  the  product  of 
Lebesgue  measure  and  the  measure  v '  on  R  with  density  7+y~a“  1  for  y>0  and  7- 
|y|-a-i  for  y<0,  for  some  constants  7  ,  7-2  0  which  are  not  both 
zero  (i.e.  in  shorthand  notation,  dv  =  dtxdy  1  =  dt  x  (7+|  y  |  ~a_  1  dy) ) . 

Let  m(e)  =  0  for  0<a<l,  let  m(£)  »  /  y(l+y2)-1  dy '  (y)  for  a-1, 

e<|y| 

and  let  m(£)  =  /  ydv  •  (y)  for  l<a<2,  and  define 

«<|yl 

(0 

(2.10.3)  t)  =  U  ydN-tm(c). 

0£S<t 

«<  I  y  1 

Here  the  integral  is  just  a  finite  sum:  if  N  has  the  points  { (tj,yj) :  j>l) 

then  |  y j  |  >  e  and  Ost^sl  only  for  finitely  many  j's,  and 

//  ydN  =  E  yr 

0<s<t  j:  t.<t  and 

(<|yji  3<iy' 

With  this  notation 

(2.10.4)  P(  sup  b(t)  -*,<  <  >  (t)  |  >  5)  -»  0,  as  £->0, 

0<t<l 

for  any  5>0. 

Let  {f  }  1  be  arbitrary  randan  variables,  let  (an>0,  bn^n=i  ^  normin9 

00 

constants,  define  stochastic  processes:  {^(t):  0<t<l)n=1  in  D[0,1]  by 

[nt] 

an«j'bn>' 


(2.10.5) 


in(‘)  -  y')l  “  h(*)  -  i  (•)  i  in  D[0,1],  and  hence 
(O  (O 

P{  sup  I  r,  (t)-r,  (t)  I  >  3)  ->  P{  sip  I  n(t)-i)  (t)  I  >  3}  as  m®, 

o<t<l  n  n  0<t£l 


since  P(sup0<t<1  I  v  (tj-m (  f  1  (t)  |  =  3)  =  0,  for  3>0.  Now, 

[nt] 

(2.10.9)  *,n(t)  -  v(ne)(t)  =  ^  anaj_bn)(1-In,j)  + 

and  (2.10.8)  thus  foiled  immediately  frem  (2.10.4). 

Conversely,  suppose  (2.10.7)  and  (2.10.8)  hold.  The  map 

(O  (O  d  (0 

which  takes  Nn  into  is  a.s.  N-continuous,  and  hence  n  *  v  , 

as  m®  in  D[0,1],  and  together  with  (2.10.8),  (2.10.9)  this  implies  that 
i)  ^  *),  by  [16],  Theorem  4.2. 

The  main  condition,  Nn  ->  N,  of  "complete"  convergence  of  extremes, 
requires  much  weaker  asymptotic  mixing  conditions  than  those  needed  for 
convergence  of  sums  to  the  normal  distribution,  cf.  the  end  of  Section  2.4. 
However,  the  local  dependence  restrictions,  such  as  (2.2.2)  may  instead  be 
rather  restrictive,  and  are  not  even  in  general  satisfied  for  1-dependent 
processes,  cf.  Example  2.2.1. 

The  conditions  of  course  became  particularly  simple  when  £  ,  £ 2,... 
are  i.i.d.  Then  -»  N  is  equivalent  to  nP(an(£1~bn)  f  A)  -»  v '  (A) ,  for  each 
Borel  set  A  c  [-® ,  -  e  ]  U  [  e ,  ®]  for  scare  c>0,  which  in  turn  is  the  same  as 

CO 

(2.10.10)  nP(an(£1~bn)  >  x)  -»  7  +  /  y-a_1dy,  for  x  >  0, 

x 

X 

i“bn)  -  x)  ■*  7-  t  (yl’a"x  fy'  for  X  <  0, 

“CD 

as  rnos.  Another  way  of  expressing  (2.10.10)  is  to  say  that  the  marginal  d.f. 


F  of  the  £  's  should  belong  to  the  domain  of  attraction  of  the  type  II 
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distribution  for  both  maxima  (if  y+>0)  and  minima  (if  *y_>0) ,  with  the  same 

norming  constants  (an>0,  bn).  Furthermore,  Resnick  ([73])  shews  that 

(2.10.10)  actually  implies  also  (2.10.8)  for  i.i.d.  sequences.  Thus  in  this 
d 

case  ■»)  ■*  t)  in  D[0,1]  is  equivalent  to  (2.10.10).  It  may  also  be  noted  that 

b  can  be  taken  to  be  zero  here, 
n 

If  one  is  not  interested  in  full  convergence  in  D[0,1],  but  only  in 

n 

"marginal"  convergence  of  in(l)  =  £  an^j~^n^  a  non-norma^  stable 

distribution,  sufficient  conditions  are  easily  found  by  "projecting  onto  the 
y-axis".  Let  N'  be  the  point  process  of  jump  heights  of  -o,  given  by 
N'(A)  =  #{tf  [0,1] :  ii(t)  -  n  (t-)  f  A]  =  N(  [0, 1]  X  A) 
for  Borel  sets  A  c  R,  so  that  N'  is  a  Poisson  process  with  intensity  v '  and 
similarly  let 

NjIj(A)  =  #{j  <  [l,n] :  a^  -  bn)  c  A)  =  Nn([0,l]  x  A) . 

By  the  same  considerations  as  in  the  last  part  of  the  proof  of  Theorem  2.10.1, 


(2.10.11) 


N '  $  N  , 


as  w®,  in  R, 


and  if,  as  before  with  I  .  =  lif|a(£.-b)|>e  and  I  .=0  otherwise, 
n  n,D  nv  ■}  n'  n,] 

limsupPd  an(?j-bn)  (1-1^)  +  m(e) I  >  s )  •*  0,  as  e  ■»  0, 

a 

for  each  5>0,  then  nn (1)  °  ^(1)  in  R.  Moreover,  it  can  be  seen  that 
(2.10.11)  holds  if  and  only  if  the  joint  distribution  of  the  k  largest  and  k 
smallest  order  statistics  of  f . , . . , , £  tends  to  the  distribution  of  the  k 
largest  and  k  smallest  jumps  of  { n  (t) :  0<t<l)  for  each  k,  cf.  the 
introduction  to  this  section.  This  approach  to  convergence  of  E  an^j  -  bn> 
to  non-normal  stable  distributions  is,  with  same  variations,  pursued  in  detail 
for  i.i.d.  £ 's  by  LePage,  Woodroofe,  &  Zinn  [(56])  and  for  stationary 
sequences  satisfying  distributional  mixing  conditions  by  Davis  ([28]) . 


Finally,  the  results  of  this  section  easily  carry  over  to  non¬ 
stationary  situations  with  [nt]  replaced  by  an  arbitrary  time-scale,  to 

I 

convergence  of  rcw-sums  in  a  doubly  indexed  array  { £  . }  to  a  Levy 

(independent  increments)  process  without  continuous  component ,  bo  multi¬ 
dimensional  £ 's,  and  also  to  convergence  of  so-called  self-normalized  sums. 


2.11  Miscellanea 

(a)  Minima  and  maxima.  Since  the  minimum  mn=min{ £  , . . . ,  £^>  can  be  obtained 
as  %  =  y  ‘  *  *  ,_^n^  *  results  for  maxima  carry  directly  over  to  minima. 

In  particular  it  follows  from  the  Extremal  Types  Theorem  that,  under 
distributional  mixing  assumptions,  limiting  d.f. 's  of  linearly  normalized 
minima  must  be  of  the  form  l-G(-x)  where  G  is  an  extreme  value  d.f.  Further, 
it  is  trivial  to  see  that  for  i.i.d.  variables  minima  and  maxima  are 
asymptotically  independent  (cf.  [55],  p.  28). 

In  a  series  of  papers  ([23],  [24],  [27]),  R.  Davis  studies  the  joint 
distribution  of  and  Mn  for  stationary  sequences  (£n)  under  a  number  of 
different  dependence  restrictions.  Here  we  only  note  that  same  of  his  results 
alternatively  may  be  obtained  as  corollaries  of  the  multivariate  theory 
discussed  in  Section  2.9  by  making  the  identification  £  .  =£  . ,  £  .  =-£  . ,  so 

X  X  1  f  2  X 

that \l^'  V2='V  For  writing  u^2-vn,  for  Vun, 

the  mixing  condition  D(u  .  ,u  )  then  translates  to  a  ,  -»  0  for  same 

r*/ x  n,2  n f±  ^ 

sequence  1  =o(n) ,  with 

an,l  =  irt!X{  i  P(?  i  5  un'  £  i  2  vn:  i  «  A  U  B)  - 

p(^i  -  V  £i  >  vn:  i  e  A)  P(£j_  <  un,  £j[  >  vn:  i  e  B)  j  } , 

where  the  maximum  is  taken  over  all  sets  Ac  (l,...,k),  Be  (k+1 , . . .  ,n) ,  for 

k=l, . . .  ,n-l .  Thus  if  this  holds  for  u  =x/a  +  b  and  v  =  y/c  +d  ,  for  all  x 

n  n  n  n  J  n  n 

and  y  it  follows  that  any  limiting  d.f.  of  (an(^-bn)f  ^"YT^n^  lnust  ^  of 


the  form  G(x,co) -G(x,-y)  where  G  is  a  bivariate  extreme  value  d.f. 
Furthermore  the  criterion  (2.9.10)  for  independence  of  componentwise  maxima, 
i.e.  here  for  asymptotic  independence  of  and  m^  translates  to 

[nA] 

lim  sup  l  (P(£  >  u  ,  %  .  <  v  )  +  P  <  v  ,  £  •  >  un)}  -»  0,  as  k-»®. 

mo  j=2  J  J 


(b)  Poisson  Limit  Theorems.  Although  somewhat  less  generally  formulated,  the 
Poisson  and  Compound  Poisson  limits  discussed  in  Section  2.4  amount  to 
convergence  of  point  processes  N  defined  from  a  triangular  array  { e  . : 
i=l,...,n,  n>l)  of  zero-one  variables ,  with  stationary  rows  £  ,...,{,  by 

li/  X  I*  f  li 


Nn(E)  =  E  £n  i' 
n  i:i/n£E  n,x 


for  Borel  subsets  E  of  (0,1].  Thus,  the  proof  of  the  Poisson  limit  for  e=l 

(see  [55],  Section  2.5)  is  easily  seen  to  shew  that  if  D^)  and  (2.2.1)  hold 

with  £  .  <  u  and  i[  .  >  u  replaced  by  £  .=0  and  f  .=1,  respectively,  then  N 

in  in  n,  1  n^  1  n 

converges  to  a  Poisson  process  with  intensity  r  if  and  only  if  nP(fn  ^1)  ■* 


t  . 


Conversely,  the  literature  contains  many  sufficient  conditions  for 

convergence,  which  may  be  applied  to  extremes  by  setting  f  .  equal  to  zero 

n,  l 

or  one  according  to  whether  <  urior^>un’  sets  of  such  conditions 

seem  particularly  useful  here.  For  the  first,  let  B  ■  be  the  <r -algebra 

H/  1 

generated  by  f  ,,...,£  . .  Then  the  relation 

3  n,l  n,i 

[nt] 


(2.11.1) 


l 

i=0 


E{f 


n,i+l' 


Bn,i> 


->  t  T 


as  m<», 


in  probability,  for  each  t£  (0,1]  is  sufficient  for  convergence  of  Nn  to  a 
Poisson  process  with  intensity  r  ([36],  [31]).  For  the  second  one,  which  is 
due  to  Berman  ([12],  [14]),  we  assume  that  each  row  has  been  extended  to  a 
doubly  infinite  sequence  •  •  • » fn  _1  fn  O'  £n  1  ** *  and  write  Sn,i 


for  the 
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<T -algebra  generated  by  (  . .  Then  the  relation 

n,l  n,i 


(2.11.1) 


^  Bn,i>  “  tT  33 


in  probability,  for  each  te  (0,1]  is  sufficient  for  convergence  of  Nn  to  a 

Poisson  process  with  intensity  r  ([36],  [31]).  For  the  second  one,  which  is 

due  to  Berman  ([12] ,  [14]) ,  we  assume  that  each  row  has  been  extended  to  a 

doubly  infinite  sequence  ,  f  .  ...  and  write  B  .  for  the 

n,u  n, ±,  n,  1 

<r-algebra  generated  by  . . .  ,  e  .  ,  e  ..  Berman's  result  is  that  if 

n ,  I*!  xi,  i 

nP(en  ^1)^7  and  if  there  exists  a  sequence  >n  of  integers,  with  7n=o(n) , 
such  that 

n  j"  p<‘n,r1'  «n/i=l)  ^  o, 

nP(tn,l-l|5n-1'n>  -  r-  m"- 

in  probability,  then  Nn  again  converges  to  a  Poisson  process  with  intensity  r . 

Neither  one  of  these  three  sets  of  conditions  imply  any  of  the  others, 
in  particular  they  are  not  necessary,  and  each  of  them  might  be  the  most 
convenient  one  in  same  situation.  Hcwever,  e.g.  for  normal  sequences  with  r^ 
log  n  ^  o  they  all  seem  to  lead  to  about  the  same  amount  of  work.  One  useful 
feature  of  (2.11.1)  is  that  it  also  directly  gives  rate  of  convergence 
results,  cf.  [77]. 


3.  Extremes  of  continuous 


rocesses. 


(•)  f  k.  I _•)  i 


«•)  l^a  •) 


-h’v-zv’.-v, 


may  then  be  simply  shown  as  in  [55,  Chapter  7]  (assuming  that  the  underlying 
probability  space  is  complete)  that  M(I)  =  sup{£(t):  t  c  I)  is  a  r.v.  for  any 
finite  interval  I  and,  in  particular,  so  is  M(T)  =  M([0,T]).  The  extremal 
types  theorem  may  be  proved  even  in  this  continuous  context,  showing  that, 
under  general  dependence  restrictions,  the  only  nondegenerate  limits  G  in 

(3.1.1)  P{aT(M(T)  -  b^)  <  x}  -*  G(x)  as  T  •*  «, 
are  the  three  classical  types. 

Though  the  general  result  requires  considerable  details  of  proof,  the 
method  involves  the  very  simple  observation  that  for  (any  convenient)  h  >  0 

(3.1.2)  M(nh)  =  max(f1,f2,...,i'n) 

where  {\=max  (£(t):  (i-l)h  <  t  <  ih).  Thus  if  (3.1.1)  holds  and  the 
(stationary)  sequence  (  ,(2>...  satisfies  D(u  )  for  each  then 

it  follows  from  the  discrete  parameter  Extremal  Types  result  (Theorem  2.1.2) 
that  G  must  be  one  of  the  extreme  value  types.  Hence  the  Extremal  Types 
Theorem  certainly  holds  for  strongly  mixing  stationary  processes  since  then 
the  sequence  (fn)  is  also  strongly  mixing  and  thus  trivially  satisfies  D(u^) . 
However  a  more  general  form  of  the  theorem  results  from  showing  that  the  D(u  ) 
condition  holds  for  the  t  ’s  when  the  f 's  satisfy  certain  conditions  -  in 
particular  a  continuous  version  C(u,p)  of  D(u^) .  In  fact  the  condition  C(uT) 
will  be  defined  in  terms  of  the  process  properties  only  at  "time  sampled" 
points  jcjj,  for  a  sampling  interval  q^o. 

The  are  of  course  maxima  of  %  (t)  in  fixed  intervals  of  length  h 
(e.g.  f^=M(h))  and  the  sampling  interval  q^  must  be  taken  small  enough  so 
that  these  are  well  approximated  by  the  maxima  at  the  sample  points  jq^.  A 
convenient  restriction  to  achieve  this  is  to  define  q(=q(u))  to  satisfy 

(3.1.3)  P{M(h)  >  u,  £  (jq)  <  u,  0  <  jq  s  h)  =  o(<l  (u) )  as  u  -»  cc. 

where  + (u)  is  a  function  which  will  later  be  taken  to  represent  the  tail  of 


s  •, 


the  distribution  of  M(h)  but  which  for  the  present  need  only  dominate 
P{  £ (0) >u}  i.e. 


(3.1.4)  P{£(0)  >  u}  =  o(rf(u)). 

In  the  following  definition  F.  .  (u)  will  be  written  for 

l’"  n 

Ft  t(u,...  u),  where  Ffc  t  (x1,...xn)  =  P{£(t1)  5  x1, . . .  £  (tn)  <  x^}. 

1* '  ’  n  1' ' *  n 

The  Condition  C(v^p)  will  be  said  to  hold  for  the  process  £  (t)  and 
the  family  of  constants  {u^,:  T  >  0),  with  respect  to  the  constants  q^,  -»  0  if 
for  any  points  s1<s2  ...  c  sp  <  t^...  <  t  ,  belonging  to  (kq^,:  0<  kq^  £T}  and 
satisfying  t  -s  2  i ,  we  have 


V-v<lv 


a 

T,y 


where  a  ->  0  for  same  family  "Y-  =  o(T)  as  T  •>  a>. 

Theorem  3.1.1.  (Extremal  Types  Theorem  for  stationary  processes)  With  the 
above  notation  suppose  that  (3.1.1)  holds  for  the  stationary  process  { £  (t) ), 
and  some  constants  a^,  bT  and  a  non-degenerate  G.  Suppose  also  that  4  (u)  is 
a  function  such  that  (3.1.4)  holds  and  T!  (Up)  is  bounded  for  uT=x/aT+b^,  for 
each  x.  If  C(uT)  holds  for  some  family  of  constants  c^^q^)  where  q=q(u) 
satisfies  (3.1.3)  then  G  must  be  one  of  the  three  classical  extreme  value 


types. 

Proof.  The  method  of  proof  is  to  take  an  arbitrary  sequence  of  points 
Tn  €  (nh,  (rtf-1) h],  write  vn=u^  and  then  relate  D(vn)  for  the  sequence 
fn  to  C(u^)  for  the  process  (£t).  This  is  achieved  by  approximating 
the  joint  distributions  of  the  by  corresponding  joint  distributions  of 
maxima  at  time  sampled  points  jq^.  Details  of  the  calculation  may  be  found  in 
[55,  Section  13.1]. 


VVWv 


A 


>VA-  v 


!-v~  a!*  .a  1  »--V- if.  a*,  jf.  f. +£*£*£*£* 


\  ■ 


3.2  Domains  of  attraction 

In  the  classical  theory  of  extremes  of  i.i.d.  sequences  the  type  of 

limiting  distribution  for  the  maximum  was  determined  by  the  asymptotic  form  of 

the  tail  of  the  distribution  of  This  remained  true  for  dependent 

stationary  cases  with  non-zero  extremal  index  since  the  limiting  type  was  that 

of  the  associated  independent  sequence.  For  continuous  parameter  processes 

however  it  is  clearly  the  tail  of  the  distribution  of  (in  view  of  (3.1.2)) 

rather  than  that  of  ^  which  determines  the  limiting  type.  More  specifically 

if  f  ,£  ...  are  i.i.d.  random  variables  with  the  same  distribution  as  {^(h) 

then  {f  }  is  called  the  independent  sequence  associated  with  If 

n  u 

the  £ sequence  has  extremal  index  0>O  then  any  asymptotic  distribution  for 
M(T)  is  of  the  same  type  as  that  for  =  max  {f^,...,fn).  Again  the  case 
0=1  is  of  special  interest  and  sufficient  conditions  may  be  given.  In 
particular  the  following  condition  (analogous  to  (2.2.2)  for  sequences)  is 
useful: 

The  Condition  c'  (u^,)  will  be  said  to  hold  for  the  process  { £  (t) }  and 
the  family  of  constants  (t^:  T>0)  with  respect  to  the  constants  (q=q(uT)  ->0) 
if 

lira  sup  (T/q)  £  P{f(0)>Ur,  Hjq)^}  ■*  u  as  0 

T->o  hcjqceT 

We  assume  also  as  needed  that  for  some  function  yt 
(3.2.1)  P{M(h)  >  u)  -  h;  (u)  as  u  ->  oo  for  0<h<5,  some  5  >  0. 

The  following  result  may  then  be  shown. 

Theorem  3,2.1.  Suppose  that  (3.2.1)  holds  for  some  function  i  and  let  {Up) 
be  a  family  of  constants  such  that  C(uT),  C' (Up)  hold  with  respect  to  a  family 

I 

(q(u)  )  of  constants  satisfying  (3.1.3)  with  h  in  C  (Up)  not  exceeding  5/2, 


where  5  is  from  (3.2.1)  .  Then  as  T  4  a> 

(3.2.2)  T  ^  (u^)  4  r  >  0 

if  and  only  if 

(3.2.3)  P{M(T)  £  t^)  4  e' T . 

Proof.  It  is  sufficient  to  show  that  (3.2.2)  and  (3.2.3)  are  equivalent  when 

T  is  replaced  by  any  sequence  T  €  (nh,  (rH-l)h]  and  ru,  by  v  =  ru,  .  But  it 

n  in 

is  readily  seen  that  P{M(nh)  <  v^}  -  P{M(Tn)  <  v^}  4  0  so  that  it  is 
sufficient  to  show  equivalence  of  the  relations  Tn  i  (v  )  4  r  and  P{M(nh)  £ 

V  "  e"- 

Now  the  sequence  (fn)  defined  as  in  (3.1.2)  satisfies  D(v^)  and  it 
follows  from  Lemma  2.1.1  that  for  fixed  k=l,2,...,  n'  =  [n/k] , 

P{M(nh)  <  v  )  -  P^{M(n'h)  £  v^}  4  0  as  n  4  ©. 

•  i 

A  further  approximation  of  P{M(n  h)  <  vn)  by  n  (vn)  may  be  obtained  by 
approximating  maxima  over  intervals  with  those  from  sampled  points  jq  ([55, 
Corollary  13.2.2])  and  the  desired  equivalences  follow  from  obvious  relations 
such  as 

1-  (r/k)  -  o(k  1)  <  lim  inf  P{M(n  h)  <v  }  £  lim  sup  P{M(n  h)  £  v  > 

rv*  03  m  co 

£  1  -  (r/k)  +  o(k_1) . 

This  result  has  the  following  immediate  corollary  linking  the 
asymptotic  distributional  properties  of  M(T)  to  those  of  the  maximum 

t  t  /  t 

I<n=mcix(f  , . . . ,  fn)  of  the  associated  independent  sequence  { i n } . 

Corollary  3.2.2  Let  (u^,)  be  a  family  of  constants  such  that  the  conditions  o 
Theorem  3.2.1  hold.  Let  0<p<l.  If 

(3.2.4)  P{M(T)  £  - 


p  as  T  4  co 
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(3.2.5)  P{1^  "  vn^  p  33  n  4  ®' 

with  Conversely  if  (3.2.5)  holds  for  same  sequence  (vn>  then  (3.2.4) 

holds  for  any  such  that  4  (u^)  ~  4  )  provided  the  conditions  of 

Theorem  3.2.1  hold. 

Proof.  This  follows  simply  from  Theorems  3.2.1  and  1.2.5  by  obvious 
identifications . 

It  may  be  seen  simply  from  this  hew  the  function  4  can  be  used  in  the 
classical  domain  of  attraction  criteria  to  determine  the  type  of  limiting 
distributions  G  in  (3.1.1)  for  M(T) .  In  this  for  an  extreme  value  d.f.  G  we 
write  D(G)  for  the  (classical)  domain  of  attraction  of  G,  i.e.  the  set  of  all 
d.f. 's  F  such  that  Fn(^an+bn)  a  G(x)  for  same  sequences  (an>0),  (bu). 

Corollary  3.2.3  Suppose  that  the  conditions  of  Theorem  3.2.1  hold  for  all 
families  of  the  form  Hj,  =  x/a^  +  bT  where  a^O  and  h^  are  given  constants  and 
that  M(T)  has  the  limiting  distribution  given  by  (3.1.1) .  Then 

(3.2.6)  h^  (u)  -  1  -  F(u)  as  u  co 

for  some  F  eD(G) .  Conversely  suppose  that  (3.2.1)  holds,  and  (3.2.6)  holds 

I  » 

for  some  F  e  D(G) .  Let  >  0,  k>n  be  constants  such  that 

Fn(x/an+b]!i)  a  G(x)  and  set  aT  =  a'[T/h]  /  bT  =  b'[T/h]  *  111611  holds 

provided  the  conditions  of  Theorem  3.2.1  hold  for  each  v^x/a^+b^. 

Proof.  This  follows  from  the  previous  corollary,  noting  that  if  F  is  the  d.f.  of 
the  associated  independent  sequence  { then  l-F(u)  =  P(M(h)  >  u}  -  h4  (u) . 

In  particular  if  4  (u)  satisfies  one  of  the  classical  domain  of 
attraction  criteria  when  substituted  for  l-F(u) ,  then  the  limiting 
distribution  for  M(T)  is  of  that  type.  Thus  4  (u)  plays  the  central  role  in 
determining  limiting  types  just  as  the  tail  l-F  does  in  the  discrete  case. 


3.3  Extremes  of  stationary  normal  processes 


In  this  section  we  briefly  indicate  how  the  results  apply  to  a 
stationary  normal  process  £  (t)  (assumed  standardized  to  have  zero  mean,  unit 
variances,  and  covariance  function  r(t)  satisfying 

(3.3.1)  r(t)  =  1  -  C;  t  i  “■  +  o(|  t|  a)  as  t  ■>  0 

for  some  OO,  0<aS2.  This  includes  all  the  mean-square  differentiable  cases 
(a=2)  and  a  wide  variety  of  cases  with  less  regular  sample  functions  (0<a<2) , 
such  as  the  Ornstein-Uhlenbeck  process  (a=l) .  It  may  be  shown  that  for  such 
a  process  (satisfying  (3.3.1))  that  a  function  t  (u)  satisfying  (3.2.1)  is 
given  by 

2. 

(3.3.2)  vMu)  =  CWa  Ha  (2m)-W2  e"U  /2, 

but  the  proof  involves  quite  intricate  computations  when  a <2  (and  indeed 
forms  the  main  part  of  the  total  discussion  of  the  asymptotic  behavior  of 
M(T) ) .  The  Ha  are  constants  whose  numerical  values  are  known  only  in  the 
cases  a=l,2  (H^l,  H2=ir~1///2)  .  The  ’'regular"  case  a=2  is  simpler  and  4  (u)  may 
then  be  alternatively  obtained  as  in  the  next  section. 

It  can  be  shown  using  the  Normal  Comparison  Lemma  (Theorem  2.5.1)  that 
the  (standard)  stationary  normal  process  £  (t)  satisfying  (3.3.1)  satisfies 
C(uT)  if  (Up)  -»  t  2  0  with  i  given  by  (3.3.2)  provided  that 

(3.3.3)  r(t)  log  t  ->  Oast-*  ®. 

To  show  this,  the  required  "sampling  intervals"  q(u)  sure  chosen  to  satisfy 

(3.1.3)  with  qu2/a  -*  0  sufficiently  slowly.  It  can  also  be  shown  quite 

I 

readily  that  C  (u^)  is  satisfied  under  the  same  conditions. 

The  function  <1  (u)  given  by  (3.3.2)  satisfies  the  domain  of  attraction 
criteria  for  the  Type  1  extreme  value  distribution  (e.g.  Theorem  1.2.4,  with 
t/=l-F)  .  Indeed  some  calculation  shows  that  (3.2.2)  holds  with  7=e  x, 
uT=x/aT+bT,  for 
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(3.3.4) 


=  (2  log  T) 


+  { ( (2~a)/2a)  log  log  T  +  log(Ci/a  Ha(2*)  7  2<  2  -« >  /  2a)  J/e^, 

Hence  Theorem  3.2.1  gives  the  following  result. 

Theorem  3.3.1.  Let  the  (standardized)  stationary  normal  process  £  (t)  have 
covariance  function  r(t)  satisfying  (3.3.1)  and  (3.3.3).  Then  P{aT(M(T)- 
fc>T)  <x}  -»  exp(-e  x)  as  T  -»  ® ,  where  aT  and  are  given  by  (3.3.4) . 

I 

This  result  was  obtained  by  Cramer  ([21])  for  the  case  a.  =  2  and  a 
somewhat  more  restrictive  condition  on  the  rate  of  decay  of  r(t)  as  t-»®.  The 
result  in  its  present  generality  was  obtained  by  Pickards  [69]  with  further 
subsequent  refinements  by  other  authors  (see  [55,  Chap.  12,  for  references]). 
In  particular  considerable  generality  is  afforded  by  the  family  of  covariances 
satisfying  (3.3.1) ,  and  the  requirement  r(t)  log  t  0  imposes  only  a  very 
mild  assumption  on  the  rate  of  convergence  of  r(t)  to  zero  as  t-)®. 


3.4  Finite  upcrossing  intensities,  and  point  processes  of  upcrossings. 

In  the  continuous  parameter  case  exceedances  of  a  level  typically 
occur  on  intervals  and  do  not  form  a  point  process.  Hcwever  a  natural  analog 
is  provided  by  the  upcrossings  (i.e.  points  where  excursions  above  a  level 
begin)  which  can  form  a  useful  point  process  for  discussing  extremal 
properties.  Further  in  many  cases  the  intensity  of  this  point  process 
provides  the  function  i  (u)  needed  for  the  determination  of  extremal  type. 
Before  proceeding  it  is  of  interest  to  note  that  an  alternative  to  discussing 
upcrossings  is  to  consider  the  amount  of  time  which  the  process  spends  above  a 
level.  This  approach,  used  by  Berman,  is  briefly  indicated  in  Section  3.7. 

Let  then  (as  before)  (£(t):  t>0)  be  stationary  with  a.s. 
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continuous  sample  functions,  and  continuous  one-dimens ional  d.f.  If  u  is  a 
constant,  £  (t)  is  said  to  have  an  upcrossing  of  u  at  tQ>0  if  for  some  £>0, 

£  (t)  5  u  in  (tQ-£,  tQ)  and  £  (t)  >  u  in  (tQ,  tQ  +  f).  (Hence  in  particular 
£(tQ)=u.)  Note  also  that  £  (t)  is  (a.s.)  not  identically  equal  to  u  in  any 
interval,  so  that  £(t)<u  at  (infinitely  many)  points  of  (tQ-£,  tQ)  and  £(t)>u 
at  infinitely  many  points  of  (tQ,  tQ+e) . 

Under  the  given  assumptions  the  number  Nu(I)  of  upcrossings  of  u  by 
£  (t)  in  an  interval  I  is  a  (possibly  infinite  valued)  r.v. .  If 
u(u)=£N  ((0,1))<«  then  N  (I)  <  a>  a.s.  for  bounded  I,  and  the  upcrossings  form 
a  stationary  point  process  Nu  with  intensity  parameter  u=u  (u) . 

For  stationary  normal  processes  satisfying  (3.3.1)  u  is  finite  when 
a=2  and  is  then  given  by  Rice's  Formula, 


(3.4.1) 


u(u)  =  (C/2)1/27r_1  e"^/2 


and  for  non-normal  processes  u  may  be  calculated  under  weak  conditions  as 


(3.4.2) 


u(u)  =  ;  z  p(u,z)  dz 
0 


where  p(u,z)  is  the  joint  density  of  £  (t)  and  its  (q.m.)  derivative  £  (t) 

In  fact  these  relations  can  be  shown  simply  since  u  (u)  =  lim  J  (u)  where 

ql  o  q 


(3.4.3) 


Jq(u)  =  q  aP{£(0)  <  u  <  £ (q) }  (q  >  0)  . 


Note  that  the  calculation  of  u  as  lim  Jg(u)  is  potentially  simple  since  Jq(u) 
depends  only  on  the  bivariate  distribution  of  £ (0)  and  £ (q) .  Under  general 
conditions  it  is  also  the  case,  when  u->*asq-»  0  in  a  suitably  coordinated 


way  that 
(3.4.4) 


(3.4.5) 


Jq(u)  -  u(u) 


P{M(q)  >  u)  =  o ( u (u) ) . 


It  then  follows  that  (3.1.3)  holds  if  ^(u)=u(u).  For 
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0  <  (qu)  x  P(£(0)  <  u,  £<q)  £  U,  M(q)  >  u) 

=  (qu)"1  [P{£(0)  <  u,  M(q)  >  u}  -P{£(0)  <  u<  £  (q) }  ] 

<  (qu)"1  [P{Nu(0,q)  >  1}  -  qJq]  <  (qu)"1  (qu-qJq) 

-  1  -  Jq/u  -»  0 

which  with  (3.4.5)  readily  gives  (3.1.3).  Also  (3.2.1)  is  often  satisfied  in 
regular  cases.  Under  such  conditions  it  thus  follows  that  +  (u)  may  be 
replaced  by  u  (u)  in  previous  results  such  as  Corollary  3.2.3.  (For  a  precise 
statement  of  conditions  see  [55,  Theorem  13.5.2]). 

Thus  the  intensity  u  (u)  can  provide  a  convenient  means  for 
determining  the  type  of  limiting  distribution  for  M(T) .  However  the  point 
process  of  upcrossings  has  further  interesting  properties  analogous  to  those 
for  exceedances  in  discrete  parameter  cases.  In  particular  a  Poisson  limiting 
distribution  may  be  obtained  after  suitable  time  normalization. 

Specifically  let  u^^  and  T  tend  to  infinity  in  such  a  way  that 
Tu(ut)-»t>0.  Define  a  normalized  point  process  NT  of  upcrossings  having  points  at 
t/T  when  £  has  an  upcrossing  of  u  at  t  i.e.  N*T(I)  =  # (upcrossings  of  by 
£  (t)  for  t/T  f  I).  Then  the  following  result  holds. 

Theorem  3.4.1  Suppose  that  the  conditions  of  Theorem  3.2.1  hold,  with 
i  (u)=u  (u) .  Then  converges  in  distribution  to  a  Poisson  Process  with 
intensity  r  as  T-»®.  This  in  particular  holds  for  the  stationary  normal 
processes  satisfying  (3.3.1)  with  a =2  and  (3.3.4). 

Similar  results  may  be  obtained  under  appropriate  conditions  for  the 
point  process  of  local  maxiira  of  height  at  least  u,  as  vho,  leading  in 
particular  to  the  asymptotic  distribution  of  (T) ,  the  k^1  largest  local 
maximum  in  [0,T].  Indeed  "complete  Poisson  convergence"  results  analogous  to 
those  indicated  for  sequences  in  Sections  1.3  and  2.4,  may  be  obtained  for  the 


"/■X  /  /  ’  '• 
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point  process  in  the  plane  consisting  of  the  locations  and  heights  of  the 
local  maxima,  (cf.  [55,  Sections  9.5  and  13.6]  for  details). 

Finally,  it  is  also  possible  to  obtain  Poisson  limits  in  cases  with 
irregular  sample  paths  when  u( u)=®  (e.g.  normal  with  0<a<2)  by  the  simple 
device  of  using  the  "  e  -upcrossings"  of  Pickards  [70]  in  lieu  of  ordinary 
upcrossings.  Specif ically,  for  given  e> 0,  £  (t)  has  an  e-upcrossing  of  the 
level  u  at  tQ  if  £  (t)  <  u  for  t  e  (tQ-f,  tQ) ,  and  £(t)>u  for  same 
tf  (t0,t0+Tj) ,  for  each  v>0,  so  that  clearly  the  number  of  e -upcrossings  in  a 
finite  interval  I  is  finite  (indeed  bounded  by  (m(I)/f)+  1  where  m(I)  is  the 
length  of  I) .  This  device  was  used  in  [69]  to  give  one  of  the  first  proofs  of 
Theorem  3.3.1. 

2 

3.5  X  -processes 

The  proofs  for  normal  processes  in  Section  3.3,  and  also  for  the 
sequence  case  (Section  2.5)  use  the  Normal  Comparison  Lemma  (Theorem  2.5.1)  in 
an  essential  way.  It  will  also  be  the  basis  for  the  present  section  on 
functions  {X  (t) )  of  stationary  d-dimensional  (d  >  2)  normal  processes 
i(t)=(£1(t),...,£d(t))  defined  as 

d 

(3.5.1)  X(t)  =  £  £2(t) . 

i=l  * 

We  shall  assume  that  the  components  are  standardized  to  have  mean  zero  and  the 
same  variance  one  -  here  this  is  a  real  restriction  and  not  just  a  question  of 

normalization  -  and  also  that  the  components  are  independent.  Then  X  (t)  has 

2  2 
a  X  -distribution  and  the  process  (X(t);  t>0)  is  called  a  X  -process  (with  d 

2 

degrees  of  freedom) .  Extremal  properties  of  X  -processes,  and  of  same 
related  functions  of  £(t),  have  been  studied  in  detail  by  Sharpe  ([80]), 
Aronwich  and  Adler  ([4],  [5]),  and  Lindgren  ([2],  [57],  [59]).  Here  we  will 


follow  the  "geometrical"  approach  of  [58],  and  use  the  fact  that  x  (t)  is  the 
radial  part  of  £(t)  to  find  the  asymptotic  double  exponential  distribution  of 
maxima  of  X (t) ,  referring  the  reader  to  [5]  for  results  on  minima.  However, 
we  will  indicate  hew  the  results  can  be  obtained  quite  smoothly  from  the 
general  theory  of  Section  3.4,  rather  than  by  using  Lindgren's  direct 
calculations. 

Now,  suppose  further  that  the  component  processes  { E  ^ (t) } ,  i=l,...,d 

are  continuously  differentiable  a.s. ,  and  have  the  same  covariance  function 

r(t) .  We  shall  presently  shew  that  u( u) ,  the  mean  number  of  u-upeross ings  by 

X  (t) ,  0<t<l,  is  easily  found  from  (3.4.2),  and  then  apply  theorem  3.4.1.  For 

1 

i=l,...,d,  E^(0)  and  E^  (0)  are  jointly  normal,  and  hence  independent,  since 
0°v(E.'(0),$.(0))  =  lim  F{h-1(E  .  (h)-£  .  (0) )  £  .  (0) }  =  r' (0)=0,  where  the 

h*0  ill 

last  equality  holds  because  r(t)  is  symmetric  around  zero.  Similarly,  if 

I 

is  the:  second  spectral  moment,  f1(t)  has  variance  \.  Thus  the 

•  •  .  i  ^  i  ^  p 

conditional  distribution  of  X  (0)  =  I  2  E  ■  (0)  E  (0)  given  X(0)=  l  E  •  (0)  =  u>0 

i=0  1  i  i=l  1 

d  2 

is  normal  with  mean  zero  and  variance  £  4  \  E  .  (0)=4Xu.  Let  p(z|  u)  be  the 

i=l  1 

density  of  this  conditional  distribution  and  let  p(u)  be  the  density  of 
X(0),  i.e. 

(3.5.2)  p  (u)  =  2"d/2r(d/2)_1  u^'V^2. 

Then,  using  (3.4.2),  it  follows  that 

CD 

u(u)  =  p (u)  ;  z  p(z|u)dz 
0 

=  2"(d'1)/2  r(d/2)_;L(X/7T)1/2  u^"1)/2  e_u/2, 
for  u  >  0.  For  u  fixed,  J  (u)=P(X(0)  <  u  <  X(q))/q  -»  u(u)  as  q->  0,  and 

'■d 

similarly  (3.4.4)  holds  also  when  u  -»  ®,  q  -»  0,  with  u1^2  q  ->  a>0  (cf.  [58, 


Lemma  3.5],  viz.  J  (u)  -  n(u)  as  q»0,  as  needed. 

Theorem  3.5.1  Let  £  (t)=(£  1(t) , . . . ,  ^(t) )  be  a  continuously  differentiable  d- 
dimensional  standardized  normal  process  with  independent  components  and  the 
same  covariance  function  r(t) ,  as  above.  Suppose  further  that  r(t)  log  t  •*  0 
as  t  co  and  that 

(3.5.3)  Tu(u^)  -»  t  as  T  -»  ®, 

★ 

and  let  be  the  point  process  of  upcrossings  of  by  {X(t/T):  tf[0,l]}. 
Then  Nt  converges  in  distribution  to  a  Poisson  process  with  intensity  r ,  and 
in  particular, 

(3.5.4)  P{max  X(t)  <  u_)  •*  e~T,  as  T  ■>  oo. 

0<t<T  1 

Proof.  We  shall  briefly  indicate  how  the  conditions  of  Theorem  3.4.1  can  be 
checked.  We  assume  that  d=2,  the  extension  to  d>2  being  straightforward .  The 
main  idea  in  [58]  is  to  introduce  the  normal  randcm  field  (X  e  (t) :  O£0<2tt, 
tjO),  where 

Xe(t)  =  S^tjcose  +  f 2 (fc)  sin0 

is  the  component  of  £(t)  in  the  direction  (cos0 ,  sin0 ) ,  and  to  note  that  then 

(3.5.5)  X(t)  =  sup  Xe(t)2 

0< 0<2n 

Thus  sup  X  (t)  =  sup  X  e  (t) 2,  and  it  follows  at  once  from 

0<t£h  0<tsh,  0<e<2m 

the  extremal  theory  for  normal  random  fields  that  (3.2.1)  holds,  for 
4(u)=u(u)  and  any  h>0,  see  [58,  Lemma  2.2].  As  noted  above  for  fixed  a>0 

(3.4.4)  holds  for  q  =  a/u1//2  and  since  clearly  P{£(0)  >  u)  =  o(u(u)),  (3.4.5) 
is  satisfied  and  thus  also  (3.1.3)  holds  with  4  (u)  =  a(u).  Thus  it  only 
remains  to  establish  C^)  and  C'  (u^) ,  for  an  arbitrary  h,  say  h=l,  and  with 
this  choice  of  q  =  q(u)  since  all  the  conditions  then  also  hold  if  a  is  taken 
to  tend  to  zero  sufficiently  slcwly  as  u  •*  ®. 


For  this  we  introduce  a 


further  sampling,  in  the  0 -direction,  given  by  a  parameter  r  =  r(u)  =  b/u 
with  b>0.  Let  x  (t)  =  max  (X^t);  i=0, . . . , [2ir/r] }.  Then,  by  (3.5.5)  and  an 
easy  geometrical  argument, 

(3.5.6)  X(t)  cos2r  <  Xr(t)2<  X(t), 

for  0<r<rt/2.  To  show  that  c'(u_)  holds  let  uT=(uT)1//2cos  r,  so  that  by 

(3.5.6) 

and  stationarity, 

(3.5.7)  £  I  P(X(0)>il,  X  ( jq)  >  u_) 

q  1<  jq<  (T 

<  -  l  P(X  (0)>vL,  x  (jq)  >  VL) 

q  l<jq<£T 

<  £  l  |P(X  (0)>ul,  X  (jq)  >  ul)  -  P(X r(0)>ul)P(Xr(jq)>u!]?)| 
ql<jq<fT 

+  £(T/q)2  P(Xr(0)  >  U^)2). 

It  is  readily  seen  that  Tu^u^)2)  •*  t  =  t  exp  (b2/2)  and  that  Xe(t)  has  mean 
zero  and  variance  one,  and  that  ] cov  (Xe(0),  Xe'(t))|  £  |  r(t) | ,  for  any 
e ,  e ' .  The  Normal  Comparison  Lemma  can  then  be  routinely  applied  to  show  that 
the  sum  on  the  righthand  side  of  (3.5.7)  tends  to  zero.  Further,  since 
u(u)/p(u)  -  (2\u/*)1/2,  by  (3.5.2)  and  since  J  (u)  u (u) ,  it  follows  from 
(3.5.6)  that 

f  (T/q) 2  P(Xr(0)  >  u^)2  <  (  (T/q) 2  P(X  (0)  >  (uj,)2)2 

=  £(Tu((v^)2))2  (P{ X  (0)  >  (uj,)2}/qu((i4)2))2 
-»  ( (  Teb^2)2  ( 8  X  tv  ) /a2 

I  # 

and  thus  C  (u^  is  satisfied. 

Next,  with  the  notation  of  0(1^,), 

(3.5.8)  |F  _  t  t  (V  “  F  s  (VFt.  t  (UT}I 


->  to'*  *>_  » 
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transform  techniques  and  the  Kolmogorov  differential  equations  (cf .  also  the  survey 
[17]),  Berman  [11]  exploits  the  regenerative  nature  of  stationary  diffusions, 
similarly  to  Section  2.6,  and  Davis  [25]  and  Berman  [13]  use  a  representation  of  the 
diffusion  in  terms  of  an  Ornstein-Uhlenbeck  process.  Here  we  shall  discuss  same 
aspects  of  Davis '  methods,  and  in  particular  state  his  main  result  (relation  3.6.6) 
below) . 

A  diffusion  process  {{  (t) ;  t  >  0}  can  be  specified  as  the  solution  of  a 
stochastic  differential  equation 


(3.6.1)  d£(t)  =  u(6(t))dt  +  <r($(t))dB(t)  , 

where  (B(t)  :  t>  0}  is  a  standard  Brcwnian  motion.  We  refer  to  [52]  for  the 
precise  definition,  and  for  the  properties  of  { £  (t) }  usedbelcw.  For  simplicity  we 
will  consider  a  somewhat  more  restrictive  situation  than  in  [52] ,  and  in  particular 


we  assume  that  { {  (t)  >  is  defined  on  seme  open,  possibly  infinite,  interval  1=  (^ ,  r2) 
and  that  u  and  <r  are  continuous,  with  <r>0  on  I. 

Let  (s(x) ;  xf  1}  be  a  solution  of  the  ordinary  differential  equation 
(3.6.2)  <r2(x)s  (X)  +  2u(x)s  (X)  =  0 

i.e.  let  it  have  the  form  s(x)=c1+c  exp  {-/^  (2u  (z)/<r2(z)  )dz)  dy,  with 

1  1  xo  xo 

c2  >  0 ,  c1  real  constants ,  for  some  point  x  f  I.  Then  s  is  strictly  increasing  and  by 


I 

Ito's  formula  nt=s($t)  satisfies  d^t=f  (^t)dBt,  for  f(x)=  s  (s  (x) )  <r  (s  (x) ) , 


i.e.  s  is  a  scale  function  and  { -n^:  t>  0}  is  the  diffusion  on  natural  scale.  The 
speed  measure ,  m,  corresponding  to  this  scale  function  then  has  density  1/f  (x) , 
i.e.  m(dx)  =  (1/f  (x)  )dx.  We  further  assume  that  the  speed  measure  is  finite,  |mj  = 

1 2  m(dx)  =  /^(l/f  (x)dx  <  m,  and  that  s(x)  -»  a>  a s  x->r2,  s(x)->-©  as  x-*^.  It  then 


follows  that  the  boundaries^,  r2are  inaccesible,  that  the  diffusion  is  recurrent, 


and  that  there  exists  a  stationary  distribution  so  that  { £  (t) )  becomes  a  stationary' 
process  if  f  (0)  is  given  this  distribution. 

The  Ornstein-Uhlenbeck  process,  which  will  be  denoted  by  { ^  (t)  )  here,  is 


.  - .  •  .••"S' 


the  stationary  diffusion  process  (3.6.1)  specified  by  I=R,  u(x)=x/2, 

<r(x)=l,  xcl.  For  the  present  purposes,  a  convenient  choice  of  scale  function 

x  2 

for  (£(t) }  is  s(x)  =  (27t)  ^  dy,  and  the  corresponding  speed  measure 

-1/2  — x? /2  ° 

is  m(dx)  =  (27i)  '  e  '  dx.  Further,  it  can  be  seen  that  (£(t)}  is  a 

—  j  1 1 

standardized  stationary  normal  process  with  covariance  function  r(t)=e  and 
that  s(x)  ~  (277)  1//2x  1  e^^2  =  (x<j>  (x) )  1,  as  X4m.  Hence,  Theorem  3.3.1  may 
be  applied  with  C=a=l  and  its  conclusion  can,  e.g.  by  a  simple  "subsequence 
argument"  be  written  as 

(3.6.3)  si®  |P(M(T)  <  u)  -e"T'/S^j  4  0  as  T  4  o, 
u>u0 

for  any  uq>0,  and  with  M(t)=sup(|  (t) :  0<t<T). 

The  main  additional  fact  needed  is  that  the  Ornste in-Uhl enbeck  process  on 
natural  scale  can,  by  a  change  of  time,  be  made  to  have  the  same  distribution  as 
H(t)}.  More  precisely,  ([25,  Theorems  2.1  and  2.2]),  there  exists  a  strictly 
increasing  random  function  (r  (t) :  t>0)  such  that  the  processes  {s({  (t) ) :  t2  0)  and 
{s(f  ( r  (t) ) ) :  0}  have  the  same  distribution,  and  which  satisfies 

(3.6.4)  T  1t  (T)  4  1/|  mj  as  T  4  a), 
almost  surely. 

As  in  Section  2.6  it  follows  easily  from  (3.6.3),  (3.6.4)  that 

-T/(s(u)|m|) 

(3.6.5)  sup  |P(M(t(T})  <  u)  -  e  |  4  0,  as  T  4  ®. 

u>u 

o 

Since  for  M(T)  =  sup{£(t):  0<t<T), 

P(M(T)  <  u)  =  P ( sup  (s(£(t)):  0<t<T}  £  s (u) ) 

=  P ( sup  { s  (  £  (  t (t) )  ) :  0<t<T)  <  s ( u ) ) 

=  P(M(t (T) )  <  s_ 1 (s(u)  )  )  , 

(3.6.5)  is  readily  seen  to  imply  the  main  result  of  [27],  that 

-T/(s(u) | m| ) 


(3.6.6) 


sup  |P(M(T)<u)-e 


4  0  as  T  4  co, 


for  any  uq  e  I  with  s  (u  )  >0 .  This  is  a  quite  explicit  description  of  M(T) ,  "as  the  maximum 
of  T  i.i.d.  random  variables  with  d.f.  G(u)  =  exp{-l/(s(u)  jm|  ) }",  and  in  particular 
domains  of  attraction  for  M(T)  cure  found  by  applying  the  classical  criteria  to  exp  {- 
l/(s(u)  |m  )}.  Finally,  as  for  Markov  chains,  the  hypothesis  of  stationarity  is  not 
essential,  (3.6.6)  holds  for  any  initial  distribution,  as  can  be  seen  e.g.  by  a  simple 
"coupling  argument". 

3.7  Miscellanea 

(a)  Moving  averages  of  stable  processes.  These  are  continuous  time  processes 
of  the  form  \  (t)  =  /  c(t-x)  df  (x) ,  with  { f  (x)  }  a  non-normal  stable  independent 
increments  process.  Their  extremal  behaviour,  which  is  similar  to  that  of  the 
corresponding  discrete  parameter  moving  average  (cf.  Section  2.7),  is  studied 
in  detail  in  [75] . 

(b)  Sample  path  properties.  As  mentioned  in  Section  2.7,  [75],  [76]  and  [29] 
also  study  the  asymptotic  distribution  of  sample  paths  near  extremes.  A 
different  approach  to  this  problem,  via  so-called  Slepian  model  processes,  has 
been  pursued  by  G.  Lindgren  in  a  series  of  papers,  cf.  the  survey  [60]  and  the 
references  therein. 

(c)  Extremal  properties  and  sojourn  times.  In  an  important  series  of  papers, 

T 

Berman  studies  "the  sojourn  of  ?  (t)  above  u",  defined  as  I+j,(u)=^  l(£(t)>u}dt, 

where  1{  •  >  is  the  indicator  function.  For  a  wide  variety  of  cases,  including 

2 

many  normal  processes,  x  -processes,  Markov  processes,  and  random  Fourier 
sums,  he  finds  the  asymptotic  form  of  the  distribution  of  I+j,(u)  as  xb®  for 
fixed  T,  and  as  u,T->c  in  a  coordinated  way.  Further,  he  uses  the  equivalence 
of  the  events  (M(T)>u)  and  (l+r(u)>0)  to  study  the  maximum  of  (£  (t) } .  This 

work  i  •:  r  ■  i  "wi';l  in  the  [  resent  journal  ([l2])  by  Borman  himself. 
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